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GLYCOSIDASE ENZYMES 
BACKGROUND OF THE INVENTION 

1. Field of the Inventions 

This invention relates to. newly identified polynucleotides, polypeptides encoded by 
such polynucleotides, the use of such polynucleotides and ^ polypeptides, as well as the' 
production and isolation of such polynucleotides and polypeptides. More particularly, the 
polynucleotides and polypeptides of the present invention has been putatively identified as 
glucosidases, a-galactosidases, p-galactosidases, 8-mannosidases, B-mannanases, 
endoglucanases, arid pullalanases. 

2. Description of Related Art 

' The glycosidic bond of p-galactosides can be cleaved by different classes of 
enzymes: - (i) phospho-P-galactosidases (EC3.2.1.85) are specific for a phosphorylated 
substrate generated via phosphoenolpyruvate phosphotransferase system (PTS)-dependent 
uptake; (ii) typical p-galactosidases (EC 3,2.1.23), represented by the Escherichia coli LacZ 
enzyme, which are relatively specific for P-galactosides; and (iii) P-glucosidases (EC 
3.2.1.21) such as the enzymes of Agrobacterium faecalis, Clostridium thermocellum, 
Pyrococcus furiosus or Sulfolobus solfataricus (Day, A.G. and Withers,- S.G.,(1986) 
Purification and characterization of a P-gliicosidase from Alcaligenes faecalis: Can. J. 
Biochem. Cell. Biol. 64, 914-922; Kengen,-S.W.M., et-al. (1993) Eur. J. Biochem., 213, 
305-3 12; Ait, N., Cruezet, N. and Cattaneo, J. (1 982) Properties of p-glucosidase purified 
from Clostridium thermocellum. J. Gen. Microbiol. 128,. 569-577; Grogan, D.W. (1991) 
Evidence that P-galactosidase of Sulfolobus solfataricus is only one of several activities of 
a thermostable P-D-glycodiase. Appl. Environ. Microbiol. 57, 1644-1649). Members of 
the latter group, although highly specific with respect to the p-anomeric configuration of 
the glycosidic linkage, often display a rather relaxed'substrate-specificity and hydrolyze p- 
glucosides as well as P-fucosides and P-galactosides. 

1 
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Generally, a-galactosidases are enzymes that catalyze the hydrolysis of galactose 
groups on a polysaccharide backbone or hydrolyze the cleavage of di- or oligosaccharides 

comprising galactose. 1 ' ■ r 

Generally, B-mannanases are enzymes that catalyze the hydrolysis of mannose 
groups internally on a polysaccharide backbone or hydrolyze the cleavage of'di- or 
oligosaccaharides comprising-mannose groups. BTmannosidases.hydrolyze;non-reducing, 
terminal mannose residues on a mannose-containing polysaccharide and the cleavage of di- 
or oligosaccaharides comprising mannose groups. . ^. ^ r . - 

Guar gum is a branched galactomannan polysaccharide composed of P-1,4 linked 
mannose backbone with a- 1,6 linked galactose side chains. The enzymes required for. the 
degradation of guar are P-mannanase, P-mannosidase and a-galactpsidase. p-mannanase 
hydrolyses the mannose backbone internally and P-mannosidase hydrolyses rion T reducing. 
terminal mannose. residues, a-galactosidase hydrolyses a-linked galactose groups. 

Galactomannan polysaccharides and the enzymes that degrade them have a variety , 
of applications. Guar is commonly used as a thickening agent in food and is utilized in 
hydraulic fracturing in oil and gas recovery. Consequently, galactomannanases are 
industrially relevant for the degradation and modification of guar. Furthermore, a need 
exists for thermostable galactomannases that are active in extreme conditions associated 
with drilling and well stimulation. . 

There are other applications for these enzymes in; various industries, such as in the 
beet sugar industry: 20-30% of the domestic U.S. sucrose consumption is sucrose from 
sugar beets. Raw beet sugar can contain a small amount of raffmose when the sugar beets 
are stored before processing and rotting begins to. set in. Raffmose inhibits the 
crystallization of sucrose and also constitutes a hidden quantity of sucrose. Thus, there is 
merit to eliminating raffmose from raw beet sugar. cc-Galactosidase has; also been used as 
a digestive aid to break down raffmose, stachyose, and verbascose in such foods as beans 

and other gassy foods. 

p-galactosidases which are active and stable at high temperatures appear to be 
superior enzymes for the production of lactose-free dietary milk products (Chaplin, M.F. 
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and Bucke, C. (1990) In: Enzyme Technology, pp. 159-160, Cambridge University Press, 
Cambridge, UK). Also, several studies have demonstrated the applicability of 0- 
galactosidases *o the enzymatic synthesis of oligosaccharides via, transglycosylation 
reactions (Nilsson, K.G.I. (1988) Enzymatic synthesis of oligosaccharides. Trends 
Biotechnol. 6, 156-264; Cote, G.L. and Tao, B.Y. (1990) Oligosaccharide synthesis by 
enzymatic transglycosylation. Glycoconjugate J. 7, 145-162). Despite the commercial 
potential/only a few P-galactosidases of thermophiles have been characterized so far. Two 
genes reported are P-galactqside-cleaving enzymes of the hyperthermophilic bacterium 
Thermotoga maritima, one of the most thermophilic organotrophic eubacteria described to 
date (Huber, R. r Langworthy, T.A., Konig, H., Thomm, M, Woese f C.R.; Sleytr, U!B. and 
S tetter, K.O. (1986) T. martima sp. nov. represents a -new genus- of unique extremely 
thermophilic eubacteria growing up to 90°C, Arch. Microbiol. 144, 324-333) one, of the 
most thermophilic organotrophic eubacteria described to date. The gene products have been 
identified as a P-galactosidase and a P-glucosidase. 

Pullulanase is well known as a debranching enzyme of pullulan and starch. The 
enzyme hydrolyzes a-l,6-glucosidic linkages on these polymers. Starch degradation for 
the production or sweeteners (glucose or maltose) is a very important industrial application 
of this enzyme. The degradation of starch -is. developed in two stages. The first stage 
involves the liquefaction of the substrate with a-amylase, and the second stage, or 
saccharification stage, is performed by fl-amylase 1 with pullalanase added as a debranching 
enzyme, to obtain better yields. 

Endoglucanases can be used in a variety of industrial applications. For instance, the 
endoglucanases of the present invention can hydrolyze the internal B-l,4-glycosidic bonds 
in cellulose, which may be used for the conversion of plant biomass into fuels, and 
chemicals. Endoglucanases also have applications in detergent formulations, the textile 
industry, in animal feed, in .waste treatment, and. in the fruit juice and brewing industry for 
the clarification and extraction of juices. 
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Brief Description of the Drawings 

The following drawings are illustrative of embodiments of the invention and are not 
meant to limit the scope of the invention as encompassed by the claims. 

< Figures la-b : are the full-length,' DN A and corresponding deduced amino acid 
sequence, of Mil TL of the present invention. Sequencing was performed using a 378 
automated DNA sequencer for all sequences of the-present invention (Applied Biosystems, 

Inc.). ; . . \ V : ■ ■ ; " 

~ : ' . Figure 2 is an illustration of the full-length DNA and corresponding deduced amino" " 
acid sequence of OC1/4V-33B/G. . ' '' V\ ' 

Figure 3 is an illustration of the full-length DNA and corresponding deduced amino 
acid sequence of F 1-1 2G. " ' . . ^ " / ' 

; Figures 4a-b are the full-length DNA; and corresponding deduced amino acid ' 
' sequence of,9N2-31B/G. _ . . • . ■ 

Figures 5a-b are- the full-length DNA and corresponding, deduced amino acid 
sequence of MSB8-6G. • ( ' 

Figure 6 . is the full-length DNA and corresponding deduced amino acid sequence 

of AEDII 1 2RA- 1 8 B/G . 7 .' ■ v '"- ; 

' Figures 7a-b are the full-length DNA and '.corresponding deduced amino acid 

sequence of GC74-22G. . ^ ■ - 

• .. Figures 8a-a are : the full-length DNA and corresponding deduced amino; acid 
sequence of VC1-7G1. 

Figures 9a-c are the fullrlength DNA and corresponding deduced amino acid 

- sequence of 37GP1.< , , 7 ■'• 

Figures l'Qa-c are the full-length DNA and corresponding deduced amino acid 
sequence of 6GC2. ;, v 1 

Figures lla-d are the full-length DNA and corresponding deduced amino acid 
sequence of 6GP2. 

Figures 12a-c are the full-length DNA and corresponding deduced amino acid 
sequence of 63GB1. 

4 . 
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Figures 13a-b are the full-length DNA and corresponding deduced amino acid 
sequence of OC 1/4 V. 

Figures 14a-e are the full-length DNA and corresponding deduced amino acid 
sequence of 6GP3. 

Figures 15a-d are the full-length DNA and corresponding deduced amino acid 

sequence of Thermotoga maritima MSB8-6GP2. 

Figures 16a-c are the full-length DNA and corresponding deduced amino acid 
sequence of Thermotoga maritima MSB8-6GB4. 

Figures 17a-d are the full-length DNA and corresponding deduced amino acid 
sequence of.Banki gouldi 37GP4. 

Figures 18a-b are the full-length DNA and corresponding deduced amino acid 
sequence of Pyrococcus ftiriosus VC1-7EG1. 

-SUMMARY OF THE INVENTION 

In a preferred embodiment of the present invention, there are provided isolated 
nucleic acids (polynucleotides) which encode mature enzymes having the deduced amino, 
acid sequences of Figures 1-18 (SEQ ID NOS: 15-28 and 61-64). 

In another embodiment, the invention provides a method for producing a 
polypeptide including culturing host cells containing the polynucleotide of Figures 1-18 and 
expressing from the host cell a polypeptide encoded by the polynucleotide and isolating the 
polypeptide. " " " 

In another embodiment, the invention provides an enzyme selected from the group 
consisting of an enzyme having an amino acid sequence set forth in SEQ ID NOS: 15-28 
or 61-64 and an enzyme which has at least 30 consecutive amino acid residue as an enzyme 
having an amino acid sequence set forth in SEQ ID NOS: 15-28 or 61-64. 

In yet another embodiment, the invention provides a method for generating glucose 
from soluble cell oligosaccharides which includes contacting a sample containing 
oligosaccharides with an effective amount of an enzyme selected from the group of 

5 
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enzymes having the amino acid sequence set forth in SEQ ID NOS: 15-28, 61-63 and 64 
such that glucose is produced ~ ' ■ ,' 

The publications, discussed herein are provided solely for their disclosure prior to 
the filing, date of, the present application. . Nothing herein is to be construed as an 
admission that the invention, is not -entitled to antedate such disclosure by virtue of prior 
invention. 

Definitions . — ; 

. "Monosaccharide", as used herein, refers to a single polyhydroxy aldehyde or 
ketone unit. 

"Oligosaccharide",^ used herein, .consist of short chains of monosaccharide units 
joined together by covalent bonds. Of these, the most abundant are the disaccharides, 
which 1 have. two monosaccharide units. - ■ ' - 

"Polysaccharide", as used herein, consists: of long chains, having; many 
monosaccharide units. 1 * . 

. The term' "gene" .means, the segment of DNA involved in producing/a polypeptide 
chain; it. includesTegiohs preceding and followingthe: coding region,(leader and trailer) as : 
well as intervening sequences (introns) between individual coding segments (exons). 

A coding sequence is "operably: linked. to" another, coding sequence when RNA 
polymerase will transcribe the two coding sequences into a single mRNA, which is then 
translated into a single polypeptide having amino acids, derived from, both coding . 
sequences. The coding sequences need not- be contiguous to one another so long as, the 
expressed sequences ultimately process to produce ,the desired protein. 

"Recombinant" enzymes refer to enzymes produced by recombinant ; DNA 
techniques; i.e., produced from cells transformed by an exogenous DNA construct encoding 
the desired enzyme. "Synthetic" enzymes are those prepared by chemical synthesis.. * 

A DNA "coding sequence of or a "nucleotide sequence encoding" a particular 
enzyme, is a DNA sequence which is transcribed' and translated into an enzyme when 
placed under the control of appropriate regulatory sequences. 
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Detailed Description of the Invention 

The polynucleotides and polypeptides of the present invention have been identified 
as glucosidases, a-galactosidases. P-galactosidases, B-mannosidases, B-mannanases, 
endoglucanases, and puilalanases as a result of their enzymatic l activity. 

In accordance with one aspect of the present invention, there are provided-novel 
enzymes, as. well as active fragments, analogs and derivatives thereof. 

In accordance with another aspect of the present invention,. there are provided 
isolated nucleic, acid molecules encoding the enzymes of the present invention including 
mRNAs, cDNAs, genomic DNAs as well as active analogs and fragments of such enzymes. 

In accordance with yet a further aspect of the present invention, there is provided 
a process for producing such polypeptides by recombinant techniques comprising culturing 
recombinant prokaryotic and/or euk.aryotic "host cells, containing a nucleic acid sequence 
of the present invention, under conditions promoting expression of said enzymes and 
subsequent recovery of said enzymes. ^ 

In accordance with yet,a further aspect of the present invention, there is provided' 
: a process for utilizing such enzymes, or polynucleotides encoding such enzymes for 
hydrolyzing lactose to galactose and glucose for use in the food processing industry, the 
pharmaceutical industry, for example, to treat intolerance to lactose, as a, diagnostic reporter 
molecule, in com wet milling, in the fruit juice industry, in baking, in the textile industry 
and in the detergent industry. • - T . 

. In accordance with yet a further aspect of the present invention, there is provided 
a process for utilizing such enzymes, for hydrolyzing guar gum. (a . .galactomannan. 
polysaccharide) to remove non-reducing terminal mannose residues. Further 
polysaccharides such as galactomannan and the enzymes according -o the invention that 
degrade them have a variety of applications. Guar gum is commonly used as a thickening 
agent in food and also is utilized in hydraulic fracturing in oil and gas recovery. 
Consequently, mannanases are industrially relevant for the degradation and modification 
of guar gums. Furthermore, a need exists for thermostable mannases that are active in 
extreme conditions associated with drilling and well stimulation. 
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In accordance with yet a further aspect' of- the. present invention, there are also ■ 
provided nucleic acid probes comprising nucleic acid molecules of sufficient length to 
specifically hybridize to a nucleic acid' sequence of the present invention. • 

In accordance With yet a furthenaspect of the preserit invention, there is provided 
a process for. utilizing such enzymes, or polynucleotides encoding such enzymes, . for in 
vitro purposes related to scientific research, for example, . to generate probes for . identifying 
similar sequences Which might encode similar enzymes from other organisms by using 
^certain : fe^ons;j.e.";.-<f6nseived Seqiieiwe tegionsrqflhe nucleotide-sequence.-:-- 1 - - ■.. — : 

These and other aspects of the present invention' should be apparent to those skilled 
in the art from the teachings herein. 

. The polynucleotides of this invention we're originally recovered from genomic gene 
libraries derived from the following organisms: - .. ' ■ ' , ■ 

M11TL is a new species of Desuifurococcus isolated "from. Diamond' Pool in 
Yellowstone National Park. The organism grows optimally at 85-88 c C. pH 7.0 in a low salt " 
■ medium containing yeast extract, peptone,' arid gelatin as substrates with a N : /CO : gas . 

' phase.- .. • '. ■ • ' 

OG1/4V , is from the genus Thermotoga. .The. organism was isolated, from, 

Yellowstone National Park,' It grows optimally at 75 °C in a low salt medium with cellulose 

as a substrate and.N-, in gas phase. : *'• 

" Pyrococcus fitriosus VCl and' (7EG1) is from the genus Pyococcus. ,'VC\ was 

.isolated from Vulcano, Italy. It grows optimally at 100°C in a high salt medium (marine) 
V containing elemental sulfur, : yeast extract, peptone and starch- as substrates and N, in gas 



Staphylothermus marinus Fl is a from the genus Staphyloderma : Fl was isolated 
from Vulcano, Italy; It grows optimally at85'C,'. pH 6.5 in high salt medium (marine) 
containing elemental sulfur and yeast extract as substrates and N, in gas phase. . 
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Thermococcus 9N-2 is from the genus Thermococcus 9N-2 was isolated from 
diffuse vent fluid in the East Pacific Rise. It is a strict anaerobe that grows optimally at 
87°C. 

Thermotoga maritime MSB8 and MSB8 (Clone # 6GP2 and 6GB4) is from the 
genus Thermotogo,m& was isolated from Vulcano, Italy. MSB8 grows optimally at 85 °C, 
pH 6 5 in a high salt medium (marine) containing starch and yeast extract as substrates and 
N 2 in gas phase. *' ' 

Thermococcus alcaliphilus AEDII12RA is from the genus Thermococcus/ 
AEDII12RA grows optimally at 85 °C, pH 9.5 in a high salt medium (marine) containing 
polysulfides and yeast extract as substrates and N 3 in gas phase. 

Thermococcus chitonophagus GC74 is from the genus Thermococcus. GC74 grows 
optimally at 85 °C, pH 6.0 in a high salt medium (marine) containing chitin, meat extract, 
elemental sulfur and yeast extract as substrates and N 2 in gas phase. AEPII la grows 
optimally at 85°C at pH 6.5 in marine medium. under anaerobic conditions. It has many 
substrates. Bankia gouldi is from the genus Bankia. 

Accordingly, the polynucleotides and enzymes encoded thereby are identified by .- 
the organism from which they were isolated, and are sometimes hereinafter referred to as. 
"Ml 1TL M (Figure 1 and SEQ ID NOS:l and .15), "OC1/4V-33B/G'' (Figure 2 and SEQ ID 
NOS:2 and 16), "F1-12G" (Figure 3 and SEQ ID NOS:3 and 17), M 9N2-31B/G" (Figure 4 
and SEQID NOS:4 and 18), "MSB8" (Figure 5 and SEQ ID NOS:5 and 19), ''AEDII12RA- 
18B/G' 1 (Figure 6 and SEQ ID NOS:6 and 20), M GC74-22G ,t (Figure 7 and SEQ ID NOS:7 
and 21), "■"■VC1-7G1" (Figure 8 and SEQ ID NOS:8 and 22), "37GF1" (Figure 9 and SEQ 
ID NOS: 9 and 23), M 6GC2 M (Figure lO andSEQ IDNOS: lO and 24), "6GP2" (Figure 11 
and SEQ ID NOS:ll and 25), "AEPII la" (Figure 12 and SEQ ID NOS:12 and 26), 
"OC1/4V" (Figure 13 and SEQ ID NOS:13 and 27), and "6GP3" (Figure 14 and SEQ ID 
NOS:28),."MSB8-6GP2" (Figure 15 and SEQ ID,NOS:57 and 6,1). H MSB8-6GB4"(Figure 
16 and SEQ ID NOS:58 and 62),"VCl-7EGl ,, (Figure 17 and SEQ ID NOS:59 and 63), 
and37GP4 (Figure 18 and SEQ ID NOS:60 and 64). 
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The polynucleotides and polypeptides of the present invention show identity at the 
nucleotide and protein 'level to known genes and proteins encoded thereby as shown in ' „ 
Table 1. , ' ■ ■■- : ' • ' ' ' - . ' ' : • ' " \ \ 

■".:)';- . ■ ," ■. ' , "■ . : Table 1 ; ; . ' \ " . 



Clone ; 


Gene/Protein with ,'. 
: Closest Homology ~ 


Protein 
' V Identity 


, Nucleic ' 
Acid 
• ..Identity- " 


M11TL-29G ■ 


. Sulfolobus suifataricus , - 
DS'M 16I6/P1, (J- 
salactosidase / ^ 




ceo/ 


OC1/4V-33B/G- 


Caldocellum 
saccharo lytic um,. p- 
glucosidase ' ; "• 


. ■ 52% 


■ 57% 


- Staphylpthermus 

rYlQrlYlUS r i - 1— O 


Bacillus pplymyxa, p- 

SaiaCLOSlUttbC 


' ; 36% 


" ,'48% 


Thermococcus 9N2- 

31B/G " 


Sulfolobus suifataricus 
! ATCC 49255/MT4 ; p- - / 
galactosidase 


• 51% 


. 50% ■ 


Thermotoga maritima 
MSB8-6G ' 


Clostridium thermocellum 
bglB * X- .' : 


45% 


' 53% 


Thermococcus 
. AEDII12RA-18B/G , . 


Bacillus polymyxa, p- 
galactosidase 


■ . 34% •'■ 


:• 48% ' 


Thermococcus v ;~. 
chitonophagus GC74- 
-22G . ;' 


Sulfolobus suifataricus: ' 
ATCC 49255/MT4, P : • 
galactosidase 


• 46% 


•■ 54% 
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Pyrococcus furiosus 
.VC1-7G1 . 


OUIIOIODUS 

su!fataricus/MT-4.P- ■ 

UdldLlUblUaifc 


'tO.H/O 


J— .J /o 


Thermotoga maritima 

cs-galactosidase 

(6GC2) 


Pediococcus pentosaceaus 
u-gaiaciosiaase 


49% 


29% 


Thermotoga maritima 
b-mannanase (ouri ) 


Aspergillus aculeatus 
mannanase 


56% 


37% 


AEPII la B- 

mannosidase C63GB 1 ) * 


Sulfolobus solfactaricus B- 
. galactosidase 


78% 


56% 


OC1/4V 

endoglucanase ' 
(33GP1) 


■Clostridium thermocellum 
endo- 1-,4-B-endoglucanase 


65% 


43% 


Thermotoga maritima 


. Caidocellum 

^ncrhnrol vti ci im ct~ 

JUv V i 1 Ha W 1 ¥ IiW Lil 11 Mr 

destrom 6 
glucanohydralase 


' 72 


53 


Bankia gouldi mix 

Endoglucanase 

(37GP1) 


None available 







The polynucleotides and enzymes of the present invention show homology to each 
, other as shown in Table 2.- 
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" Table 2 ' 



Clone . ■ 


* ■ Gene/Protein with 
. " Closest Homology - 


Protein - 
Identity' , 


. Nucleie \ 
Acid . 
Identity . 


Staphylothermus * 
marinas F1-12G 


Thermococcits 

aediii2ra-i8b/g, p- ' - 

■ galactosidase; glucosidase . 


" 55% "'; 


57% 


fhermococcus • 9N2- r 
! 31B/G : 1 * ■ 


Thermococcus v . 
chitonophagus-.GClA- ■ ; 
22G-glucosidase\ / v * / . 


' 74%; , . 


' .v 66% • • 


> Pyrococcus furiosiis 
■ VC1-7G1 1 , 


Pyrococcus furiosus VC1 - 
. 7B/G p-galactosidase 


< , 46.4% : 


\ 54% • 



;■. All' the* -clones identified in Tables' rand' 2 -encode polypeptides' which nave a- 
glycosidase or P-glycosidase activity'. ' - ' • . , , - * ' 

1 ■ ' This invention, in addition to the isolated nucleic acid molecules encoding; the \ 
enzymes of the present invention; also provide, substantially similar. sequences.' Isolated 
nucleic acid sequences are. substantially similar if: (i) they are capable of hybridising under 
conditions hereinafter described, to the polynucleotides of SEQ ID NOS: 1-14 and 57-60;. 

. (ii)orthey encode DN A sequenc es which are degenerate to the po lynucleotides of SEQ ID*. 
NOS: 1-1 4 and 57-60. Degenerate DNA sequences encode the amino. acid sequences of 
SEQ ID NOS: 15-28 and 61-64, but have variations in the nucleotide coding sequences. As 
used herein, substantially similar refers to the sequences having similar identity to the 
sequences of the instant invention. . The nucleotide sequences that are substantially the same " 
can be identified by hybridization or by sequence comparison;- Enzyme' sequences that are 
substantially .the same can be -identified by one or more -of the following:- proteolytic 
digestion, gel electrophoresis and/or microsequencing. . 

One means for isolating the nucleic acid molecules encoding the enzymes of the 
present invention is to probe a gene library with a natural or artificially designed probe 
using art recognized procedures (see/for example: Current Protocols in Molecular Biology, 
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Ausubel F.M; et al .(EDS.) Green Publishing Company Assoc. and John Wiley Interscience; 
New York, 1989, 1992). It is appreciated to one skilled in the art that the polynucleotides 

. of SEQ ID NOS: 1-14 and 57-60 or fragments thereof (cpmprising.at least 12. contiguous 
nucleotides), are particularly useful probes. Other particular useful probes, for this purpose 
are hybridizable fragments to the sequences of .SEQ' ID NOS: 1-14 and 57-66 (i.e., 
comprising at least 1 2 contiguous nucleotides). ' 

With respect to nucleic acid sequences which hybridize to specific. nucleic acid 
sequences disclosed herein, hybridization may be carried out under conditions of reduced 
stringency, medium stringency or even* stringent, conditions. As an; example of 
oligonucleotide hybridization, a 'polymer membrane containing immobilized denatured 
nucleic acids is fust prehybridized for 30 minutes at 45 °C in a solution consisting of 0.9 M 
NaCL 50 mM NaH : P0 4 , pH 7.0. 5.0 mM N&EDTA, 0.5% SDS, 10X Denhardt's, and 0.5 
mg/ml polyriboadenylic acid. Approximately 2 X 10 7 cpm (specific activity 4-9 X ill 
cpm/ug) of 32 P end-labeled oligonucleotide probe are then added to the solution.. After 12- 
16 hours of incubation, the membrane is washed for 30 minutes at room temperature in I X 
SET (1 50 mM NaCl, 20 mM Tris hydrochloride, pH 7.8,1 mM Na : EDTA) containing 0.5% 

> SDS, followed by a 30 minute wash in fresh 1 X SET at Tm 10°C for the oligonucleotide 
probe. The membrane is then exposed to autoradiographic film for detection of 
hybridization signals. • ' 

Stringent conditions means hybridization will occur only if there is at least 90% 
identity, preferably at least 95%,identity and most preferably at least 97% identity between 
the sequences. Further, it is understood that a section of a 100 bps sequence that is 95 bps 
in length has 95% identity with the 1090 bps sequence from which it is obtained. See J. 
Sambrook et al. Molecular Cloning, A Laboratory Manual, 2d Ed, Cold Spring Harbor 
Laboratory (1989) which is hereby incorporated by reference in its entirety. Also, it is 
understood that a fragment of a 100 bps sequence that is 95 bps in length has 95% identity 

'with the 100 bps sequence from which it is obtained,, ' , 

As used herein, a first DNA (RNA) sequence is at least 70% and preferably at least 
80% identical to another DNA (RNA) sequence if there is at least 70% and preferably at 
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least a 80% or 90% identity,, respectively,. between the bases of the' first 'jequence and the 
. bases of the another- sequence, when properly aligned, with each other, for example when 

aligned by BLASTN. . - : — ' V,' , \ / . / 

1 "Identity" as the term is used herein,. refers to a polynucleotide sequence which, 
comprises a percentage of the same bases as a-reference polynucleotide (SEQ ID NOS:i-14 ' 
and' 57-60).. For example, a polynucleotide .which is at least 90% identical to a reference 
polynucleotide, has polynucleotide bases, which are .identical' in 90% of the 1 bases which . 
make up .the 'reference polynucleotide and may have different bases in.. 10% of the bases ' 
..which comprise that polynucleotide sequence. t 

1 / 1 The present invention relates: polynucleotides Which differ from the reference 
. polynucleotide such that the changes are silent changes, for example the change do not alter ; 
• the amino acid sequence encoded by the polynucleotide. The present invention also relates 
to nucleotide changes which result in amino acid substitutions, additions, deletions, fusions 
ani* truncations- in the polypeptide encoded by. the reference polynucleotide, in a preferred 
aspect of the invention these polypeptides retain the same biological action as the 
polypeptide encoded by the reference polynucleotide. 

■ It is also appreciated that such probes can be and are preferably labeled with an 
analytically detectable reagent 'to facilitate identification of. tile .probe.., .Useful reagents 
include but are not limited to radioactivity, fluorescent dyes or enzymes capable* of 
catalyzing the' formation of a detectable product. The probes are; thus useM to isolate 
compl'ementary.copies of DNA from other sources or to screen such sources for related 

sequences. ' f ' ■■ * «■* { : . 

■ The polynucleotides of this, invention were recovered from genomic gene libraries 
from the organisms listed in Table 1. For. example, gene- libraries can be generated in the 
. Lambda ZAP ' II cloning vector (Stratagene Gloning'.Systems).. . Mass excisions -can be 
performed on these libraries to generate libraries in the pBluescript phagemid. Libraries 
.are thus generated and excisions performed according to the protocols/methods hereinafter 
described. 
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The excision libraries are introduced" into the £ coli strain BW 14893 F'kanlA. 
Expression clones are then identified using a high temperature filter assay. Expression 
clones encoding several glucanases and several other glycosidases are identified and 
repurified. The polynucleotides, and enzymes encoded thereby, of the present invention 
yield the activities as described above. 

The coding sequences for the enzymes of the present invention were identified by- 
screening the genomic DNAs prepared for the clones having glucosidase or galactosidase 
activity. 

An example of such an assay is a high temperature filter assay wherein expression 
clones were identified by use of high temperature filter assays using buffer Z (see recipe 
below) containing 1 mg/ml of the substrate 5-bromo-4-chIoro-3-indolyl-P-D- 
glucopyranoside (XGLU) (Diagnostic Chemicals Limited or Sigma) after introducing an 
excision library into the £ coli strain BW14893 F'kanlA. Expression clones encoding 
XGLUases were identified and repurified from Ml 1TL. OC1/4V, Pyrbcoccus furiosus VC1, 
Staphylothemus marinus Fl, Thermoco.ccus 9N-2 ? Thermotoga maritima MSB8, 
Thermococcus alcaliphilus AEDII12RA. and Thermococcus chitonophagus GC74. 

Z-buffer: '(referenced in Miller, J.H. (1*992) A Short Course in Bacterial Genetics, * 

p. 445.) 

' per liter: 

Na : HPO,-7H : 0 ' 16.1g 

NaH;PO,-7H : 0 5.5g 

KC1 1 ' - * 0.75g - - - - • ■ 

MgS0 4 -7H 2 0 0.246g' 1 ' 

P-mercaptoethanol 2.7m! 
Adjust pH to 7.0 

High Temperature Filter Assay 
(1 ) The f factor fkan (from £. coli strain CSH 1 1 8)(1 ) was introduced into the pho-pnh- 
lac-strain BW1 4893(2). BW1 3893(2). The filamentous phage library was plated 
oh the resulting strain, BW14893 F'kan. (Miller; J.H. (1992) A Short" Course in 
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x Bacterial Genetics; Lee, K.S., Metcalf, etal.,( 1992) Evidence for two phosphonate ; 

. degradative pathwaysin Enterobacter Aerogenes, J. Bacteriol., 174:2501-2510. 

. (2) d After growth on 100 mm LB plates containing 100 |ig/mL ampiciliin. 80 \xg/ml 

" . nethicillin and ImM IPTG, colony "lifts were performed using Millipore HATF 

5 membrane filters. 

. . ' (3), The colonies transferred lo the, filters,wer£ lysed with chloroform vapor; inT 50 mm 

> / ' glass petri dishes. 

. : - . .. . (4) ..The filters were .transferred to 100 mm glass petri dishes containing a piece 1 of , 

■ Whatman 3 MM filter paper saturated with buffer.. •. 

io ' ( a ) when testing for galactosidase activity (XGALase), 3MM paper was 

'saturated with Z buffer containing 1 mg/ml XGAL (ChemBridge 
Corporation). After transferring niter bearing lysed colonies to the glass 
petri dish, placed dish in oven at 80-85 3 C. 
. (b) when testing for glucosidase (XGLUase), 3 MM paper was saturated 

15 . .; ; . with Z Buffer containing 1 mg/ml XGLU. After transferring filter:, bearing 

;;' ' : lysed colonies to the glass petri dish, placed dish in oven, at 80-85 °C: 

v " .(5) ' 'Positives' were observed as blue 'spots on the filter membranes. Used the following 
filter fescue technique to retrieve' plasmid from lysed positive ; colony. Used pasteur 
pipette (or glass capillary tube) to core blue spots on the filter membrane. Placed- 
20" .. ' the small filter disk in an Eppendorf tube containing 20^1 water. Incubated the 

■ Eppendorf tube at 75°C for 5 minutes followed by vortexing to elute plasmid DNA 
;1 J off filter. This DNA was transformed into electrocompetent.£ co/i cells DH10B 

for. Thermatoga maritima MSB8-6G, Staphylothermus marinus F1-12G, 
Thermococcus AEDII12RA-18B/G,- Thermococcus chitonophagus "GG74-22G, 
25 • M11T1 andOCl/4V. Electrocompetent'BW 14893 F'kanl A £.co// were used for 

Thermococcus 9N2-3 1 B/G, and Pyrococcus furiosus VC 1 -7G 1 . Repeated filter-lift 
assay; on transformation plates to identify 'positives'. Return transformation plates 
to 37°C incubator after filter lift to regenerate colonies. Inoculate 3 ml LB liquid 
* containing 100 |ig/ml ampiciliin with repurified positives and incubate at 37°G 
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' overnight. Isolate plasmid DNA from these cultures and sequence plasmid insert. 
In some instances, where the*plates used for the initial colony lifts -contained non- 
confluent colonies, a specific colony corresponding to a blue spot on the filter could 
be identified on a regenerated plate and repurified directly, instead of using the filter 
rescue' technique. ■ ■ . \ ' 

Another example of such an assay is a variation of the high temperature filter assay 
wherein colony-laden filters are heat-killed at different temperatures (for example, 105°C 
for 20 minutes) to monitor thermostability, The 3 MM paper is saturated with different 
buffers (i.e.; 100 mM NaCl, 5 mM MgCI 2 , 100 mM Tris-Cl (pH 9.5)) to determine enzyme 
activity under different buffer conditions. 

A p-glucosidase assay may also be employed, wherein GlcpPNp is used as an 
artificial substrate (aryl-P-glucosidase). The increase in absorbance at 405 nm as a result 
of p-nitropheno'l (pNp) liberation was followed on a Hitachi U-l 100 spectrophotometer, 
equipped with a thermostatted cuvette holder. The assays may be performed at 80°C or 
90°C in. closed 1-ml quartz cuvette. A standard reaction mixture contains 150 mM 
trisodium substrate, pH 5.0 (at 80 °C), and 0.95 mM pNp derivative pNp = 0.561 mM* 1 cm" 
l ). The reaction mixture is allowed to reach the desired temperature, after which the 
reaction is started by injecting an.appropriate amount of enzyme (1.06 ml final volume). 

- 1 U P-glucosidase activity is defined as that amount required, to catalyze the 
formation of 1.0 um'ol pNp/min. D-cellobiose may also be used as a substrate. 

An ONPG assay for p-galactosidase activity is described by Miller, J.H. (1992) A 
Short Course in Bacterial Genetics and Mill, J.H. (1992) Experiments in- Molecular 
Genetics, the contents of which are hereby incorporated by reference in their entirety. 

A quantitative fluorometric assay for p-galactosidase specific activity is described 
by : Youngman P., (1987) Plasmid. Vectors for Recovering and Exploiting'- Tn9 17 
Transpositions in Bacillus and other Gram-Positive Bacteria. In Plasmids: A Practical 
' approach (ed. K. Hardy) pp 79-103. IRL Press, Oxford; A description of the procedure can 
be found in Miller (1992) p. 75-77, the contents of which are incorporated by reference 
herein in their entirety. 
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The polynucleotides of the present invention may. be. in the form, of DNA which 
DNA includes cDNA, genomic DNA, and synthetic DNA. The. DNA may be double- 
::i -stranded or single-stranded, and if single stranded may be the coding strand or non-coding 
' (anti-sense) strand. The coding sequences' which. ; encodes the mature, enzymes may be 
identical to the coding sequences shown in Figures' 1-8 '(SEQ ID NOS: .1-14 and 57-60) or 
may be a- different codingisequence which coding sequence, as a result of the redundancy 
or degeneracy of Uhe genetic code, encodes the same mature enzymes as the DNA of 
: Figures 1-1 8-(SEQ ID NOS: 1-14 ando7-60). • . ? • . 

The polynucleotide which encodes for the mature enzyme of Figures 1 -18 (SEQ ID 
NOS: 15-28 and'61-64) may include; but is not limited to: only the coding sequence for the 
mature enzyme; the coding sequence for the mature enzyme and additional coding sequence • 
such as a leader sequence or a proprotein sequence; the coding sequence for the mature 
enzyme (and optionally additional coding sequence) and non-coding sequence, such as 
- introns or non-coding sequence 5' and/or 3' of the coding sequence for the mature enzyme. 

Thus., the term "polynucleotide Encoding an enzyme (protein)"- encompasses a 
.polynucleotide -which includes '6m j coding sequence for the enzyme as well as a 
.■polynucleotide which includes additional coding and/or non-coding seque;nce. 

The present invention further' relates to variants of the hereinabove described 
polynucleotides which encode for fragments, analogs and derivatives o f the enzymes having 
the deduced amino acid sequences of Figures 1-18 (SEQ ID NOS: 15-28 and ,6 1-64). The 
variant of the polynucleotide may be a naturally occurring allelic variant of the 
polynucleotide or anon-naturally occurring, variant of the polynucleotide. ' ■ 

Thus, the present invention includes polynucleotides encoding the same, mature, 
enzymes as shown in Figures 1-18 (SEQTD NOS: 1.5-28 and 61-64) as well as variants.of 
such polynucleotides which variants' encode for a fragment, derivative or analog of the 
enzymes of Figures 1-18 (SEQ ID NOS : 15-28. and , 6 1-64). Such nucleotide .variants 
include deletion variants, substitution variants and addition or. insertion variants. 

As hereinabove indicated, the polynucleotides may have a coding sequence which 
is a naturally occurring allelic variant of the coding sequences shown in Figures 1-18 (SEQ 
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ID NOS: 1-14 and 57-60). • As known in the art, an allelic variant is an alternate form of. a. 
polynucleotide sequence which may have a substitution, deletion or addition of one or more 
nucleotides, which does not substantially alter the function of the encoded enzyme. 

- Fragments of the -full length gene of the present . invention may be used as a 
hybridization-probe for a cDNA or a genomic library to isolate the full length DNA and to 
isolate other DNAs which have a high sequence similarity to the gene or similar biological 
activity. Probes of this type preferably have at least 10, preferably at least 15,, and even 
more preferably at least 30 bases and may contain, for example, at least 50 or more bases. 
The probe may also be used to identify a DNA clone corresponding to a full length 
transcript and a genomic clone or clones that' contain the complete, gene including 
regulatory and promotbr regions, exons, and introns,, An example of a screen comprises 
isolating the coding region of the gene by using the known DNA sequence to synthesize an 
oligonucleotide probe. Labeled oligonucleotides having a sequence complementary to that 
of the gene of the present invention are used to screen a library of genomic DNA to 
determine which members of the library the probe hybridizes to. 

The present invention further relates to polynucleotides which hybridize to the 
hereinabove-described sequences if there is at least 70%, preferably at least 90%, and more 
preferably at least 95% identity between the sequences. The present invention particularly 
relates to polynucleotides which hybridize under stringent conditions to the hereinabove- 
described polynucleotides. - As herein used, t^e term "stringent conditions" means 
hybridization will occur only if there is at least 95% and preferably at least 97% identity 
between the sequences,. The polynucleotides which hybridize to the hereinabove described ■ 
polynucleotides in a preferred embodiment encode enzymes which either retain 
substantially the same biological function or activity as the mature enzyme encoded by the 
DNA of Figures 1-18 (SEQ ID NOS: 1-14 and : 57-60). . 

Alternatively, the polynucleotide may have at least 15 bases v preferably at least 30 
bases, and more preferably at least 50 bases which hybridize to any part of a polynucleotide 
of the present invention and which has an identity thereto, as hereinabove described, and 
which may or may not retain activity. For example, such polynucleotides may be employed 
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as- probes for the polynucleotides of . SEQ ID NOS: 1-14 and 57-60,, for example, for 
recovery of the polynucleotide or as a diagnostic probe or as a PCR primer. ; - « 

• Thus, the present invention is directed to polynucleotides, having at least a 70% 
identity, preferably at least 90% identity and more preferably at least a 95% identity to a 
polynucleotide which' encodes the. enzymes of SEQ ID NOS: .15-28 and 61-64 as well- as 
fragments thereof, which fragments have -at least 15 bases, preferably at' least 30 bases and 
most preferably at least 50 bases, which fragments are at'least 90% identical,, preferably at 
least 95% identical and mostpreferably.at least 97%. identical under stringent conditions 
to any portion of a polynucleotide of the present invention. . - • 

The present invention. further relates to enzymes which have the deduced amino acid * 
sequences 'of Figures 1-18 (SEQ ID NOS: 15-28 and 61 : 64) as well as fragments, analogs ' 
and derivatives of such enzyme. v , t , ■ 

■ .. " ' The terms "fragment," "derivative" and "analog" when refemng to the enzymes of 
Figures 1-18 (SEQ tt>NOS: 15-28 and 61-64) means enzymes which retain essentially the 
same biological function or activity as such enzymes. Thus, an analog includes a proprotein 
which can be activated by cleavage of the proprotein portion to produce an active mature 

enzyme. ' r ' : ■ ' ..;*"\ ■ . . ■■ 

The enzymes of the present invention may be-a recombinant enzyme, a natural 
enzyme or a synthetic, enzyme, preferably a recombinant enzyme. 

V The fragment, derivative or analog of the enzymes of Figures 1-1 8 (SEQ ID NOS: 
• 15-28 and' 6 1-64) may be (r) one in which one or more of the amino acid residues are- 
substituted with a conserved'or non-conserved amino, acid residue (preferably a consented 
amino acid. residue) and such substituted amino acid residue may or may riot be one 
encoded by the genetic code, or (ii) one in which one or more of the amino acid residues 
includes a substituent group, or (iii) one .in which the mature enzyme is fused .with-another 
compound, such as a compound to increase the half-life of the enzyme (for example, 
polyethylene glycol), or (iv) one in which the additional amino acids are fused to the mature 
enzyme, such as a leader or secretory sequence or a sequence which is employed for 
purification of the mature enzyme or a proprotein sequence. Such fragments, derivatives 
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and analogs are deemed to be within the scope of those skilled in the art from the teachings 
herein. 

The enzymes and polynucleotides of the present invention are preferably provided 
in an isolated form, and preferably are purified to homogeneity. * 

5 ' * The term "isolated" means that the material- is removed from its original 

environment (e.g., the natural environment if it is naturally occurring). For example, a 
naturally-occurring polynucleotide or enzyme present in a living animal is not isolated, but ' 
the same polynucleotide or enzyme, separated from some or all of the coexisting materials 
in the natural system, is isolated. Such polynucleotides could be part of a vector and/or 

10 such polynucleotides or enzymes could be part of a composition, and still be isolated in that 

such vector or composition is not part of its natural environment. 

The enzymes of the present invention include the enzymes of SEQ ID NOS: 15-28 
and 61-64 (in particular the mature enzyme) as well as enzymes which have at least 70% 
similarity (preferably at least 70% identity) to the enzymes of SEQ ID NOS: 15-28 and 61- 

l5 . 64 and more preferably at least 90% similarity (more preferably at least 90% identity) to 

the enzymes of SEQ ID NOS: 15-28 'and 61-64 and still more preferably at least 95% 
similarity (still more preferably at least 95% identity') to the enzymes of SEQ ID NOS: 15- 
'28 and 61-64 and also include portions of such enzymes with such portion of the enzyme 
generally containing at least 30 amino acids and more preferably at least 50 amino acids. 

20 As known in the art "similarity" between two enzymes is determined by comparing 

the amino acid sequence and its conserved amino acid substitutes of one enzyme to the 
sequence of a second enzyme. — - ... . . 

A variant, i.e.. a "fragment", "analog" or "derivative" polypeptide., and reference 
polypeptide may- differ in amino acid sequence by one or more substitutions, additions, 

25 deletions, fusions and truncations, which may be present in any combination. 

Among preferred variants are those that vary from a reference by conservative 
amino acid substitutions. Such substitutions are those that substitute a given amino acid in 
a polypeptide by another amino acid of like characteristics. Typically seen as conservative 
substitutions are the replacements, one for another, among the aliphatic amino acids Ala, 
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Val", Leu and He; interchange of the hydroxyl residues Ser and Thr, exchange of the acidic 
residues Asp and Glu, substitution between the amide residues Asn and Gin, exchange of 
the basic residues Lys and Arg and replacements among the aromatic residues Phe, Tyr.' 

Most highly preferred are variants which retain the same. biological function and 
activity as the reference polypeptide from which it varies. 

Fragments or portions of the enzymes of thepresent invention may be .employed for 
producing the corresponding full-length enzyme by peptide" synthesis:, therefore, the 
fragments may be employed as intermediates for producing the full-length enzymes. 
Fragments; or portions of the polynucleotides of .the present invention' may be .used, to 
synthesize full-length polynucleotides of the present invention. , 

The present invention also relates to vectors which include polynucleotides of the 
present invention, host cells which are genetically 'engineered with vectors of the invention 
•and the production of enzymes of the invention by recombinant techniques. 

Host cells.are genetically engineered (transduced^or transformed or transfected) witlx: 
the vectors; of this invention which may be, for example, a cloning vector or an expression - 
vector. The vector may be, for example^ in the form of a plasmid.-a viral particle, a phage; 
etc. The engineered host cells can be cultured in conventional nutrient media modified as 
appropriate foractivating, promoters, '.selecting transforaiants or amplifying the-genes-of the 
present invention: The cuiture conditions, such.-as temperature, pH and the like; are those . 
previously used with the host cell selected- for expression,: and' will be apparent to. the 1 ' 
ordinarily skilled artisan. 

- The polynucleotides of the present invention" may be employed for producing 
.enzymes by recombinant techniques. Thus, for example, the polynucleotide may' be 
.included in any one of a variety of expression vectors for expressing an enzyme. Such 
■vectors' include chromosomal, nonchromosomal = and synthetic DNA sequences, e.g., 
derivatives of SV40; bacterial plasmids; phage DNA; baculovirus; yeast plasmids; vectors 
derived from combinations of plasmids and phage DNA, viral DNA such as vaccinia, 
adenovirus, fowl pox virus, and pseudorabies. However, any other vector may be used as 
long as it is replicable and viable in the host. 
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The appropriate DNA sequence -may be inserted into the vector by a variety, of . 
procedures. In general, the DNA sequence is inserted into an appropriate restriction 
endonuclease site(s) by procedures known in the-art. Such procedures and others are 
deemed to be within the scope of those skilled in the art. 

The DNA sequence in the expression vector is operatively linked to an appropriate 
expression control sequence(s) (promoter) to direct mRNA synthesis. As representative 
examples of such promoters, there may be mentioned: LTR or SV40 promoter, the E. coli, 
lac or trp, the phage lambda P L promoter and other promoters known to control expression 
of genes in prokaryotic or eukaryotic cells or their viruses. The expression vector also 
contains a ribosome 'binding site for translation initiation and a transcription terminator. 
The vector may also include appropriate sequences for amplifying expression. 

In addition, the expression vectors preferably contain one or more selectable marker 
genes to' provide a phenocypic trait for selection of transformed host ceils such as 
dihydrofolate reductase or neomycin resistance for eukaryotic cell culture, or such; as 
tetracycline or ampiciliin resistance in EjcoH. , 

The vector containing the appropriate DNA sequence as hereinabove described^; as 
well as an appropriate promoter or control sequence, may be employed to transform an 
appropriate host to permit the host to express the protein. • - %. 

As representative examples of appropriate hosts, there may be mentioned:, bacterial 
cells, such as E. coli , Streptomvces . Bacillus - subtilis ; fungal cells, such as. yeast; insect .cells 
such as Drosophila S2-and Soodoptera Sf9 : animal cells such as CHO, COS or Bowes 
melanoma; adenoviruses; plant cells, etc. The selection of an appropriate host is deemed 
to be within the scope of those skilled in the art from the teachings herein. 

More particularly, the -present invention*-also includes recombinant constructs 
comprising one or more of the sequences as broadly described above. The constructs 
comprise a vector, such as a plasmid or viral vector, into which a 'sequence of the invention 
has been inserted, in a forward or reverse orientation. In a preferred aspect , of this 
embodiment, the construct fiirther comprises regulatory sequences, including, for example, 
a promoter, operably linked to the sequence. Large numbers of suitable vectors and 
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promoters are known to -those, of skill in- the art: and , are, commercially .available. The 
following vectors are provided by way of example;. Bacterial: pQE70, pQE60, pQE-9 
(Qiagen). pDlO. psiX174, pBluescript.il KS, pNH8A ? pNH16a v pNH18A,' pNH46A 
(Stratagene); ptrc99a, pKK223-3, pKK233-3, pDR540, pRIT5 (Pharmacia); Eukaryotic: 
pSV2CAT, pOG44, pXTl, pSG (Stratagene) pSVK3, pBPV/pMSG, pSVL (Pharmacia). 
However, any other plasmid or vector may be used as long .aY they are replicable and viable * 
in the host. - . ' 

Promoter regions can be selected from any ' desired gene . using CAT 
'(chloramphenicol transferase) vectors or other -vectors with selectable markers. . t Two- 
appropriate vectors are pKK232'-8 .and pCM7, Particular named bacterial' promoters include 
lacl lacZ, T3, T7, gpt, lambda P R , P L and tip. Eukaryotic promoters include CMV 
immediate early, HSV thymidine kinase-;' early and late,SV40 ? LTRs fronv retrovirus, and 
mouse metallothiohein-I. Selection of the appropriate, vector and promoter is well within 
the level of ordinan' skill, in the art. \ . . . ' 

In a further embodiment, the' present invention relates to host cells containing the 
above-described constructs. The host cell can be a ; higher eukaryotic cell such as a 
/mammalian cell, or a. lower eukaryotic ceil, such- -as: a yeast cell, or the host cell can be a 1 
prokaryotic cell, such as a bacterial cell. Introduction of the construct into the. host cell' can 
be effected by calcium phosphate transfection. DEAE-Dextran mediated .transfection, or' 
electroporation (Davis, L, Dibnen M., Battey, . 'L v Basic Methods in Molecular Biology, \ 
(1986)). . V v . ■ '• 

The constructs in host . cells can be used in a conventional manner .to" produce the 
gene product encoded by, the recombinant sequence. Alternatively, the enzymes of the 
invention can be synthetically produced- by conventional peptide synthesizers. 
> - Mature proteins can be expressed in mammalian cells, yeast, bacteria, or-other cells 
under -the control of appropriate promoters.- Cell-free translation systems can also be 
employed to produce such proteins using RNAs derived from the DNA constructs of the 
* present invention. Appropriate cloning and expression vectors for use with prokaryotic and' 
eukaryotic hosts are described by Sambrook-, et al., Molecular Cloning: A Laboratory 
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Manual. Second Edition, Cold Spring Harbor, N.Y., (1989), the, disc jsure of which is 
hereby incorporated by reference. 

* Transcription of the DNA encoding the enzymes of the present invention by higher 
eukaryotes is increased by inserting an enhancer sequence into the vector. Enhancers are 
cis-acting elements of DNA, usually about from 10 to 300 bp that act on. a promoter to 
increase its transcription. Examples include the SV40 enhancer on the late side of the 
replication origin bp 100 to 270, a cytomegalovirus early promoter enhancer, the polyoma 
enhancer on the late side of the replication origin, and adenovirus enhancers. 

Generally, recombinant expression vectors will include origins jf replication and 
selectable markers permitting transformation of the host cell, e.g., the ampicillin resistance 
gene of E. coli and S. cerevisiae TRP1 gene, and a promoter derived from a highly- 

. expressed gene to direct transcription of a downstream structural sequence. Such promoters . 
can be derived from operons encoding glycolytic enzymes such as 3-phosphoglycerate 
kinase (PGK), a-factor, acid phosphatase, or heat shock proteins, among others. The 
heterologous structural sequence is assembled in appropriate phase with .translation 
initiation and termination sequences, and preferably, a leader sequence capable of directing 
secretion of translated enzyme. Optionally, the- heterologous sequence can encode a fusion 
enzyme including an N-terminal identification peptide imputing desired characteristics, 
e.g., stabilization or simplified purification of expressed recombinant product. 

Useful expression vectors for bacterial use are constructed by inserting a structural 

- DNA sequence encoding a desired protein together with suitable translation initiation and 
termination signals in operable reading. phase with a functional promoter. The vector will 
comprise one or more phenotypic selectable markers and an origin of replication to ensure 
maintenance of the vector and to, if desirable, provide amplification within the host. 
Suitable prokaryotic hosts for transformation include E. coli . Bacillus subtilis . Salmonella 
tvphimurium and various species, within the genera Pseudomonas, Streptomyces, and 
Staphylococcus, although others may also be employed as a matter of choice. 

As a representative but nonlimiting example, useful expression vectors for bacterial 
use can comprise a selectable marker and bacterial origin of replication derived from 
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' commercially available plasmids comprising-genetic elements of the well known cloning 
vector pBR322 (ATCC 37017). Such commercial vectors include, for example. pKK223-3 
' (Pharmacia Fine Chemicals, Uppsala, Sweden) and GEM 1 (Promega Biotec, Madison, WI, 
USA). These pBR322 "backbone" sections-are combined with an appropriate promoter and 
the structural sequence to be expressed. . > ... 

Following transformation of a suitable host strain and growth of the host strain to 
an appropriate cell density, the selected; promoter is. induced by, appropriate means (e.g., 
temperature shift or chemical induction) and cells are cultured for an additional period. ■ 

Cells : are typically harvested by centrifugation, disrupted by physical or chemical* 
means, and the resulting crude extract retained for further purification. ' 

Microbial cells employed in expression of proteins can- be-. disrupted by any 
convenient'method. including freeze-thaw cycling; sonication, mechanical disruption, or 
use of cell lysing agents, such methods are well known to those skilled in the art. ' . 

Various mammalian cell culture systems can also be - employed to . express 
recombinant protein. Exampies-of mammalian expression systems, include the COS-7 lines 
: of monkey kidney fibroblasts, described by Gluzman,. Cell, 23:175 (1981), and other cell 
■lines capable of expressing 'a compatible vector, for example, the C127, 3T3, CHQ, HeLa 
and BHK cell lines. Mammalian expression vectors will comprise an origin of replication, 
a suitable promoter and enhancer,' and also any necessary ribosome binding: sites', 
polyadenylatipn site, splice donor and acceptor sites, transcriptional termination sequences, 
and 5' flanking nontranscribed'sequences. DNA sequences derived .from the SV40 splice, 
and polyadenylation ■ sites may be used to provide the required nontranscribed genetic 
elements. - .• . • . ' • , • ' \ * 

The enzyme can be recovered and purified from recombinant cell cultures by 
methods including ammonium sulfate or ethanol precipitation,: acid extraction, anion or 
ckion exchange chromato^aphy, phosphocellulose chromatography, hydrophobic 
■ interaction chromatography, affinity chromatography, hydroxylapatite chromatography and 
lectin chromatography. Protein refolding steps can be used, as necessary, in completing 
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configuration of the mature protein. Finally, high performance liquid chromatography 
(HPLC) can be employed for final purification steps. 

The enzymes of the present invention may be a naturally purified product; or a 
product of chemical synthetic procedures, .-or produced by recombinant techniques from a 
prokaryotic or eukaryotic host (for example, by bacterial, yeast, higher plant, insect and 
mammalian cells in culture). Depending upon the host, employed in a recombinant 
production procedure, the enzymes of the present invention may be glycosylated or may be 
non-glycosylated. Enzymes of the invention may or may not also include an initial 
methionine amino acid residue. 

' P-galactosidase hydrolyzes lactose to galactose and glucose; Accordingly, the 
OC1/4V, 9N2-31B/G, AEDII12RA-18B/G and F1-I2G enzymes may be employed in the 
food processing industry for the production of low lactose content milk and for the 
production of galactose or glucose from lactose contained in whey obtained in a large 
amount as a by-product in the production of cheese. Generally, it is desired that enzymes 
used in food processing, such as the aforementioned p-gaiactosidases, be stable at elevated 
temperatures to help prevent microbial contamination. 

These enzymes may also be employed in the pharmaceutical industry. The enzymes 
are used to treat intolerance to lactose. In this case, a thermostable enzyme is desired, as 
well. Thermostable p-galactosidases also have uses in diagnostic applications, where they 
are employed as reporter molecules. 

Glucosidases act on soluble cellooligosaccharides from the non-reducing end to give 
glucose asthe-sole product. Glucanases.(endo- and exo-) act jn the depolymerization of 
"cellulose, generating more non-reducing ends (endo-glucanases, for instance, act on internal 
linkages yielding cellobiose, glucose and cellooligosaccharides as products). p- 
glucosidases are- used in applications where glucose is the desired product. Accordingly, 
M11TL, F1-12G, GC74-22G, MSB8-6G , OC1/4V, VC1-7G1, 9N2-31B/G and 
AEDII12RA1 8B/G may be employed in a wide variety of industrial applications, including 
in corn wet milling for the separation of starch and gluten, in the fruit industry for 
clarification and equipment maintenance, in baking for viscosity reduction, in the textile 
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industry for the processing of. blue jeans, and in the detergent industry as an additive. Eor 
these and other applications, thermostable enzymes are desirable. 

Antibodies generated- against , .the enzymes, corresponding to a sequence of the 
present invention can be obtained by direct injection of the enzymes, into an animal or by 
administering. the enzymes to an animal, preferably a nonhuman: The antibody so obtained' 
: will then .bind the enzymes, itself: In this manner,' even a sequence encoding only a 
fragment of the enzymes can be used to .generate antibodies binding the whole, native 
enzymes. 'Such antibodies can then be- used to'isolate the^ enzyme from cells expressing that 
enzyme. • " . * " • 

' For preparation, of monoclonal antibodies, any technique which provides antibodies ' 
produced by continuous cell line cultures can be used. Examples include thchybridoma 
technique (Kohler. and Milstein, 1975, Nature, 256:495-497); the trioma' technique, the 
human B-cell hybridomaiechnique (Kozbor et al., 1 983 . Immunology Today 4:72), and the 
EBV-hybridoma "technique to; produce human monoclonal antibodies (Cole, et aL 1985, in 
Monoclonal Antibodies and'Cancer Therapy, Alan R. Liss. Inc., pp. 77-96). 

Techniques described for the production of single chain antibodies -(U.S. Patent 
4,946778) can be adapted to produce single chain antibodies to immunogenic enzyme ' 
products of this invention. Also; transgenic mice may be used to express humanized 
antibodies to immunogenic enzyme products of this invention. * >' f 

. Antibodies generated against the enzyme of the present invention may be used in 
screening for similar- enzymes from ; other organisms and samples.' Such screening 
techniques are known, in the art, for example, one such screening assay, is described in 
"Methods for Measuring Cellulase Activities", Methods in enzymology. Vol 160, pp. 87- 
1 16; which is hereby incorporated by reference in its entirety. •' ■ 

The present invention will be further described with reference to the following 
examples; however, it is to be understood that the present invention is not limited to such 
examples. All parts or amounts, unless otherwise specified, are by weight. 1 

In order to facilitate understanding of the following examples certain frequently 
occurring methods and/or terms will be described. 
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"Plasmids" are designated by a lower case-p preceded and/or followed by capital 
letters and/or numbers. The starting plasmids herein- are- either commercially available, 
publicly available on an unrestricted basis, or can be constructed from available'plasmids 
in accord with published procedures. In addition, equivalent plasmids to those described 
are known in the art and will be apparent to the ordinarily skilled artisan. 

"Digestion" of DNA refers to catalytic cleavage of the DNA with a restriction 
enzyme that acts only at certain sequences in the DNA. The various restriction enzymes 
used herein are commercially available and their reaction conditions f cofactors and other 
requirements were used as would be known to the ordinarily skilled artisan. For analytical 
purposes, typically" 1 jig of plasmid or DN A fragment is used with about 2 units of enzyme 
in about 20 ul of buffer solution. For the purpose of.isolating DNA fragments for plasmid 
construction,, typically 5 to 50 ug of DNA are digested with 20 to 250 units of enzyme in 
a larger volume. Appropriate buffers and substrate amounts for particular- restriction 
enzymes are specified by the manufacturer. Incubation times of about 1 hour at 37°C are 
.ordinarily used, but may vary in accordance with the supplier's instructions. After, digestion 
the reaction is electrophoresed direct/ on a poiyacryiamide' gel to isolate the desired 
fragment. • ' 

Size separation of the cleaved fragments is performed using 8 percent 
poiyacryiamide gel described by GoeddeL D.'et al, Nucleic Acids Res., 8:4057 (1980). 

"Oligonucleotides" refers to either a single stranded polydeoxynucleotide or two 
complementary polydeoxynucleotide strands which may be chemically synthesized. Such 
-synthetic oligonucleotides have; no 5' phosphateand thus will not ligate to another 
oligonucleotide without adding a phosphate with an ATP in the presence of a kinase. A 
synthetic oligonucleotide will ligate to a fragment that has not<been dephosphorylated. 

"Ligation'' refers to the process of forming phosphodiester -bonds .between two 
double stranded nucleic acid fragments (Maniatis, T., et aL Id'., p. 1*46}. Unless otherwise 
provided, ligation may be accomplished using known buffers and conditions with "10 units 
of T4 DNA ligase ("ligase") per 0.5 jig of.approximately equimolar amounts of the DNA 
fragments to be ligated. - ;•' 
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Unless otherwise- stated, transformation was performed as described in the method 
. W Graham, F. and Van- d^ 1 * ; /■ 

, u ' ' ■ : . . ' Example 1 • - . . :. ■ >' 

Bacterial E x pression and Purification of Glvcosidase E hzvmes 

■ * DNA encoding the enzymes of the present invention, SEQ ID NOS: 1-14 and 57-60 
were initially amplified from a pBluescript vector containing the DNA by the PGR 
-technique using the primers noted-herein. The. amplified sequences were then. inserted into ; 

• the respective PQE vector listed beneath the primer sequences, and the enzyme was 
expressed according to the protocols set forth herein. The 5' and 3- primer sequences for 

• the, respective genes are- as follows: ; : 

• Thermococcus AEDII 1 2RA - 1 8B/G ; . , ' ' 

; ;5'tCGAGA^^ 

3' CGGAAGATCTTC ATAGC7CCGG AAGCCCATA 5* (SEQ ID N0:30) - 

'/Vector:' pQEl 2; and contains the following restriction enzyme sites .5' EcoRTand 3' Big ; 
OC174V-33B/G . • . v . \ / . "v\ < . 

5 ' CC G AG AATTC ATT/VA^AG AGG AG A A\TT.AACT ATG AT/VAG.^G GTCC G ATTTTC C 3' ' . ' . * .* 

. (SEQlDNO:3l) \" ■. ' , ' ■ ' v ; './ 

^ }' CGGAAGATCTTTAAGATTTTAGAAA1TCCTT 5' (SEQ lQ NO:3-2) • , , - [ ' 

- Vector: pQE12; and contains the following restriction enzyme sites 5' EcoRI and 3' Bgl 
'' II./. . , . ;. ; v.,'.--/-,:; . /iJ.V , ; .-' • \ 

• Thermococcus 9N2 - 31B/G . • vr , • 

5' CC G AG AATTC ATT AAAG AGG AG AAA'IT/^CT ATGCTAC C AG AAGG CTTTCTC 3' . 

(SEQIDNO:33) * • ' • • • : 

V CGGAGGTACCTCACCCAAGTCCGAACTTCTC 5' (SEQ ID NO:34) • 

Vector: pQE30; and contains the following restriction enzyme sites 5' EcoRI. and 3" 
*■ KpnI. 
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Staphy'.oihermm marinus 'Fl - 12G - 

5' CCGAGAATTCATTAAAGAGGAGAAATT/VACTAfGATAAGGTTTCCTGATTAT 3' 

(SEQ IDNO;35) * . 

3' CGGAAGATGTTTATTCGAGGTTCTTTAATCC 5' (SEQ ID NO:36) 

Vector: pQE12; and contains the following restriction enzyme sites 5' EcoRI and 3' Bgl 
II. - 

Thermococcus chitonophagus GC74 - 22G 

S'.CCGAGAATTCATTCATTAAAGAGGAGAAATT^ 3* 
(SEQIDNO:37) 

3' CGGAGGATCCCTACCCCTCCTCTAAGATCTC5' (SEOIONO:38) , 

Vector: pQE12; and contains the following restriction enzyme sites 5' EcoRI and 3' 
BamHI. • ' 

M'llTL * .. . 

5* AATAATCTAGAGCATGC.^-\TTCCCC.-\A-\GACTTCATGATAG 3' (SEQ ID NO:39) 
■3* AATAAA\GC■^rACTGGATCAGTGTA\GATGCT■5 , (SEQ ID NO:40) 

Vector: pQE70; and contains the following restriction enzyme sites 5' SphI and 3' Hind 
III. 

Thermotoga maritima MSB8-6G , 

5' CCGACAATTGATT.UaGAGGAGAAATTAACTATGGAAAGGATCGATGAAATT 3* (SEQ ID NO:41 ) 
3' CGGAGGTACCTCATGGTTTGAATCTCTTCTC 5' (SEQ ID NO: 42) 

Vector: pQE12; and contains the following restriction enzyme sites 5' EcoRI and 3' 
KpnI. --- - - - ... - r - 

' Pyrococcus furiosus VC\ -7GI ^ ■ „ 

5' CCGACAATTGATTAAAGAGGAGAAATTAACTATGTTCCCTGAAAAGTTCCTT 3' (SEQ ID NO:43) 
3' CGGAGGTACCTCATCCCCTCAGCAATTCCTC 5* (SEQ ID NO:44)- 

Vector: pQE12; and contains the following restriction enzyme sites 5' EcoRI and 3' Kpn 
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Bahkia gouldi end'oglucanase (37GP1) 

5\WAAGGATCCGTITAGCGACGCTCGC3;(SEQ'IDNO:45) ' • 

3* ' AATAAAAGCTTCCGGGTTGTACAGCGGTAATAGGC 5' (SEQ ID NO:46) 

Vector: pQE52; and contains the following restriction enzyme sites 5' Bam HI and 3' 
Hind III. 

Thermotoga maritima a-galactosidase (6GC2) 

5'TTTATTGAATTCATTAAAGAGGAGAAATTAACTATGATCTGTGTGGAAATA 

(SEQ IDNO:47) : ; . ,. .. . - . . ; . - - 

3' TCTATAAAGCTTTCATTCTCTCTCACCCTGTTCGTAGAAG 5' (SEQ ID NO: 48} 

Vector: pQET; and contains the following restriction -enzyme sites 5' 'EcoRI and 3' Hind 

in. ! ; 

Thermotoga maritima 6-mannanase (6GP2) 

5' YTTATTCAATTGATTAAAGMGAGAAATTAA 3' 
(SEQ1DN0:49) 

3' TTTATT/\AGCTTATCTTTTCATATlt ACATACCTCC 5' (SEQ.ID NO:50.) ' 

Vector: pQEt; and contains the following restriction enzyme sites 5' Hind III and 3 T 
EcoRI " " ' - " 

AEPIIla fl-mannanase.(63GBl) '.' . 

5' TTTArTGAATTCATTAAAGAGGAGAA-^TtAACTATGCTAGCA 3' ' 

(SEQIDNO.51) ' . - 

y TTTATTAAGCTl'CTCATC/KACGGCTATGGTCTTCATTTC 5' {SEQ ID NO:52) 

' Vector: pQEt; and contains the following restriction enzyme sites 5' Hind III and 3' 
EcoRI. 

OC1/4V endoglucanase (33GP1) 7 ~ ""T : ~ ' : > " 

5' AAAAAACAATTGAATTCATTAAAGAGGAGAAATT^ 

3* (SEQIDNO:53) ' • . 

3' TTTTTCGGATCCAATTCTTCATTTACTCTTTGCCTG 5' (SEQ ID NO:54) 
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Vector: pQEt; and contains the following restriction enzyme sites 5' BamHI and 3' .- . 
EcoRI. . ' 

Thermptoga maritima pullalahase (6GP3) t - 

5' TTTTGGAATTCATT.AAAGAGGAGAAA.TTA.ACTATGGAACTGATCATAGAAGGTTAC 3' 
<SEQIDNO:55) 

3' ATAAGAAGCTnTCACTCTCTGTACAGAACGTACGC 5' (SEQ. ID NO:56) \ 

Vector: pQEt; and contains the following restriction enzyme sites 5' EcoRI and 3' Hind 
III. - , 

The restriction enzyme sites indicated correspond to the restriction enzyme sites on 
the bacterial expression vector indicated for the respective gene (Qiagen/Inc. Chatsworth, 
CA). The pQE vector encodes antibiotic resistance (Amp 1 ), a bacterial origin of replication 
(on), an IPTG-reeulatable promoter operator (P/0) t a ribosbme binding site (RBS), a 6-His 
tag and restriction enzyme sites. 

* The pQE vector was digested with the restriction enzymesindicated. The amplified 
sequences were ligated into the respective pQE vector and inserted in frame with- the 
sequence encoding for the RBS. The ligation mixture was then used to transform the E. coli 
strain M15/pREP4 (Qiageh, Inc.) by electroporation. M15/pREP4 contains multiple copies 
of the plasmid pREP4, which expresses the lad repressor and also confers kanamycin 
resistance (Kan"). Transformants were identified by their ability to grow on LB plates^and 
ampicillin/kanamycin resistant colonies were selected. Plasmid DNA was isolated and 
-confirmed by restriction analysis.. .Clones containing.the desired constructs were grown 
overnight (O/N) in liquid culture in LB media supplemented with both Amp (100 ug/ml) 
and Kan (25 ug/ml). The O/N culture was used to inoculate a large culture.at-a ratio of 
1 :100 toT :250. The cells were grown to an optical .density 600 (CXD. 60 ?) of between 0.4 and 
0.6. IPTG - ("Isopropyl-B-D-thiogalacto ' pSTanoside") was then added to a final 
concentration of 1 mM. IPTG induces by inactivating the lad repressor, clearing the P/O 
leading to increased gene expression. Cells were grown an extra 3 to 4 hours. Cells were 
then harvested by centrifugation. 
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-The primer sequences set put above may also be employed to isolate the target gene, 
from the deposited material by hybridization techniques described above. . 

Example 2 

Isolation of A Selected £lone From the Deposited genomic clones 

■ A clone' is isolated directly by screening the deposited material s 

- oligonucleotide primers set forth in Example 1 for the particular gene desired to be 
•isolated. .The specific oligonucleotides are synthesized using an Applied Biosysiems : . 
DNA synthesizer. The oligonucleotides are labeled with 32 P-'-ATP using T4 
polynucleotide kinase and purified according to a standard protocol (Maniatis et ah, 
Molecular Cloning: A Laboratory Manual, Cold Spring HarborPress, ? Cold Spring, NY V 
1982). The deposited clones in the pBluescript vectors may be employed to transform 
bacterial. hosts which are then plated on. 1 .5% agar, plates. to the density ^of 20,000- 
50,000 pfu/150 mm plate- These plates are screened using Nylon membranes according 

. to the standard screening protocol (Stratagene, 1993). . Specifically, the Nylon , , , 
•membrane with denatured and fixed T^NA is prehybridized in 6 x SSC, 20 mM 
NaH,P0 4 , 0.4%SDS, 5 x Denh'ardt's.500 ng/ml denatured,, sonicated^salmon sperm 
DNA;. and 6 x SSC, 0.1% SDS. After one hour of prehybridization-, the membrane is 
' Hybridized with hybridization buffer 6xSSC, 20 mM NaH 2 PO,, ,0.4%SDS; 500 .ug/ml. 
denatured, -sonicated salmon sperm DNA with 1 xl 0 6 cpm/ml 32 P-pr6be overnight at 
42°C. The membrane is washed at 45-50 °C with washing buffer 6 x SSC, 0,1% SDS 
for 20-30 minutes dried and exposed to Kodak X-ray filnvovernight. Positive clones are ' 
isolated and purified by secondary and tertiary screening. The purified clone is 
sequenced to verify its identity to the primer sequence. 

- • Once the clone is isolated, the two oligonucleotide primers corresponding to the 
gene of interest are used to amplify' the gene, from the deposited material. , A polymerase 
chain reaction is carried out in 25 ul of reaction mixture with 0.5 ug of the DNA of the 
gene of interest. The reaction mixture is K5-5 mM MgCU, 0.01% (w/v) gelatin, 20 fiM . 
each of dATP, dCTP, dGTP, dTTP, 25 pmol of each primer and 0.25 Unit of Taq 
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polymerase. Thirty five cycles of PCR (denaturation at 94 °C for 1 min; annealing at 
55 °C for 1 min; elongation at 72 °C for 1 min) are performed with the Perkin-Elmer 
Cetus automated thermal cycler. The amplified product is analyzed by agarose gel 
electrophoresis and the DNA band with expected molecular weight is excised and 
purified. The PCR product is verified to be the 'gene of interest by subcloning and 
sequencing the DNA product. The ends of the newly purified genes are nucleotide 
sequenced to identify full length sequences. Complete sequencing of full length genes is 
then performed by Exonuclease III digestion or primer walking. 

Example 3 
Screening for Galactosidase Activity 

Screening procedures for a-galactosidase protein activity may be assayed for as 
follows; 

Substrate plates were provided' by a standard plating procedure. Dilute XL1- 
Blue MRP E colt host of (Stratagene Cloning Systems, La Joila, CA) to O.D. 600 =/l .0 
with NZY media. In 15 ml tubes, inoculate 200 iA diluted host cells with phage.- Mix 
gently and incubate tubes^at 37 °C for 15 min. Add approximately 3.5 ml LB top 
agarose (0.7%) containing ImM IPTG to each tube and pour onto all NYZ plate surface. 
Allow to cool and incubate at 37 °C overnight. The assay plates are obtained as 
substrate p-Nitrophenyl a-galactosidase (Sigma) (200 mg/100 ml) (100 mM NaCl, 100 
mM Potassium-Phosphate) 1% (w/v) agarose. The plaques are overlayed with 
nitrocellulose and incubated at 4 °C for 30 minutes whereupon the nitrocellulose is " 
removed and overlayed onto the substrate plates. The substrate plates are then incubated 
at 70 °C for 20 minutes. - 
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. " Example 4 

Screening of Clones for Mannanase Activity , 

.'. A solid phase screening assay was utilized as a primary screening method lo test - 
clones for fi-xannanase activity. 

'". a culture solution of the YJ 090-£. coli host strain (Stratagene Gloning Systems, 
La Jotla/CA) was diluted to O.D. 600 =1 0 with NZY media. The amplified library from 
Thermoioga mdritima lambda gill library-was diluted in SM (phage dilution buffer): 5 
■x 10! ,pfu/ul, diluted- 1 : 10.00 then 1:100 to 5 x 10 : pfW^l. Then 8 ul of phage dilution 
(5 x 10 2 pfu/ul) was plated in„200 ul host cells. They were then incubated in 1 5 ml 
tubes at 37 °C for 15 minutes. 

Approximately 4 ml of molten, LB top agarose (0.7%) at approximately 52 °C 
was added to each' tube and the mixture was poured onto the surface of LB agar plates. 
The agar plates were then incubated at 37 °C for five hours. The'plates were replicated * 
and induced with-10 mM IPTG-soaked Duralon-UV™' nylon membranes (Stratagene ■ • 
Gloning Systems. La Jolla, CA) overnight. The nylon membranes and plates were' . 
• .-marked with a needle to keep their orientation and the nylon membranes were. then. . 
removed and stored at 4 °C. ' ' . ' • -' 

An Azo-galactomannan overlay was applied to the LB plates containing the ' • 
lambda plaques. The overlay contains 1% agarose. 50. mM potassium-phosphate buffer 
pH 7, 0.4% Azpcarob-galactomannan, (Megazyme, Australia). The plates were, 
■ incubated at 72 .?C. The Azo'carob-galactomannan treated plates were observed after 4 
hours then; returned to incubation overnight. Putative positives were identified by . 
clearing zones on-the Azocarob-galactorriaiinan plates. Two positive clones were 

observed. " • 

-The nylon membranes referred to above, which correspond to the^ositive..clones 

were retrieved, oriented over the plate and the portions matching the locations of the 
clearing zones for positive clones wre cut out. Phage was eluted from the membrane 
cut-out portions by soaking the individual portions in 500 ul SM (phage dilution buffer) 
and 25 ul CHC1 3 . 
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. • Example 5 

' Screening of Clones for Mannosidase Activity 

A solid phase screening assay was utilized as a primary screening method to test 
clones for B-mannosidase activity.' 

A culture solution of the YJ 09Qt£ coll host strain (Stratagene Cloning Systems, 
LaJolla, CA) was diluted to O.D.^LO with NZY media. The amplified library from . 
AEPII lalambdagtll library was diluted in SM (phage dilution buffer): 5 x 10 7 pfu/|il 
diluted 1:1000 then 1 : 100 to 5 x 10 2 pfti/|il. , Then 8>il of phage dilution . 
(5 x TO 2 pfii/fil) was plated in 200 |il host cells. They were then incubated in. 1 5 ml 
tubes at 37 °C for 15 minutes. t , \ 

Approximately 4 ml of molten, LB top agarose (0.7%) at approximately 52 °C - 
was added to each tube and themixture was poured onto the- surface of LB agar plates. 
The agar plates were then incubated at 37 °C for five nours. The plates were replicated 
and induced with 10" mM IPTG-soaked Duraion-UY™ nylon membranes (Stratagene 
Cloning Systems, La Jolla, CA) overnight. The nylon membranes and plates were, 
marked with a needle to keep their orientation and the nylon membranes were then 
removed and stored at 4 °C. . ' u 

A p-nitrophenyl-B-D-manno-pyranoside overlay was applied to the LB plates . 
. containing the lambda plaques. .The overlay contains 1% agarose, 50 mM potassium- 
phosphate buffer pH 7, 0.4% p-nitrpphenyl-B-D-manno-pyranoside. (Megazyme, 
. Australia). The plates were incubated at 72 °C. The p-nitrophenyl-B-D-manno- • 
_pyranoside treated plates were observed after 4 Jiours jhen returned to incubation ■ _ 
overnight. Putative positives were identified by clearing zones on the p-riitrophenyl-B- ■ 
D-manno-pyranoside plates. Two positive clones were observed. 

The nylon membranes referred to above, which correspond to the positive; clones 
were retrieved, oriented over the plate and the portions matching the locations of the t . 
clearing zones for positive clones .wre cut out. Phage was eluted from the membrane 
cut-out portions by soaking the individual portions in 500 jxLSM (phage dilution buffer) 
.and 25 nl CHCi 3 . 
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Example 6 
Screening for Pullulanase Activity 

■ '■ • ' Screening procedures for pullulanase protein activity may be. assayed for as 

; follows: \ . \ ,/ ■ _ ' * . ;"- ,! / • • • • 

: Substrate plates were provided by a standard plating procedure'/ Host-ceils are ' 

. diluted to O.D. 6C0 = 1.0 with NZY or appropriate media. In. 15 ml 1 tubes, inoculate 200 • , 
Ail diluted host cells with phage. Mix gently. and incubate tubes at 37 °C for 15 min. ■ 
Add approximately 3,5 ml LB.top agarose (0.7%) ( is added to each tube : and the mixture 
is plated, allowed to cool, and incubated at 37 °C for about 28 hours. Overlays of 4.5 ' 
mis of the following substrate are poured: >' •;' . ,• r ' ' 

• 1 00 ml total volume ■ ' • ... ' ■ .'-^ - -,';< ♦ ' '> V > 

\ 0.5g • Red Pu.llulan Red (Megazyme. Australia) 

l.Og ' . Agarose . , , • • •• > .', . . :\ 

. 5ml ' Buffer (Tris-HCLpH 7:2 @ 75 °G) J/', ^' v ^ • 

. 2ml • ' 5MNaCl . " ' '.\ : . : ' \ " ' ; .' 
; , 'Sml^ 'CaGK (lO.OmM) ' ' / : . ; . ' *. ; ; 

/• 85ml • ■ dH 2 0 • ; ' ; - ' \ ■ " ' 

. Plates are cooled at room temperature, and thenm incubated at 75 °C for 2 hours. 
Positives are observed as showing substrate degradation. - ^ 

; ( . • Example 7 ■ > ' ; . , ; \ 

Screening for Endoglucanase Activity ! . 

I Screening procedures, for endoglucanase protein. activity may be -assayed for as -, ! 

"follows": ' ' -y~* t --*> : ^. : - a - " ■-:*S- , v'-"_--;_ • ■ v; " " - ^■■■.^ : ; - * - _ 

1 . ' . The gene library is plated onto 6 LB/GelRite/0.1% CMC/NZY agar plates 
(-4,800 plaque forming units/plate) in E.coli host with LB agarose as top agarose. The 
plates are incubated at 37°C overnight. 
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2. Plates are chilled at 4 C C for one hour. 

3. The plates are overlayed with Duralon membranes (Stratagene) at room 
temperature for one hour and the membranes are oriented and lifted off the plates and 

■ stored at4°C. 

.4. The top agarose layer is removed and plates are incubated at 37°C for -3 

hours! 

5: The plate surface is rinsed with NaCl. 
' 6. The plate is stained with 0.1% Congo Red for 15 minutes. 

7. The plate is destained with lMNaCI. 

8. The putative positives identified on plate are isolated from the Duralon 
membrane (positives are identified by clearing zones around clones). The phage is 
eluted from the membrane by incubating in 500(il SM + 25]iTCHCl 3 to elute, 

9. Insert DNA is subcloned into any appropriate cloning vector and 
subclones are reassayed for CMCase activity using the following protocol; 

i) Spin 1ml overnight miniprep of clone at maximum speed for 3 . 

minutes. 

ii) Decant the supernatant and use it to .fill "wells", that have been 
made in an LB/GelRite/0.1% CMC plate. 

* iii) Incubate at 37° C for 2 hours. 

iv) Stain with 0. 1 % Congo Red for 15 minutes. 

v) Destain with 1 M NaCl for 1 5 minutes. 

vi) - Identify positives by- clearing. zone, around clone. _ ; 

Numerous modifications and variations of the present invention are possible in 
light of the above teachings and, therefore, within the scope of the appended claims, the 
invention may be practiced otherwise than as particularly described. 
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WHAT IS CLAIMED IS : 

" I,. , An isolated. polynucleotide selected from the group consisting of: ■ 
;..(a) SEQ ID NOS: l-14and 57-60;/ 
' " (b) SEQ. ID NOS: 1-14 and, 57-60 ? wherein T can also be U; 

" (c) .' polynucleotide sequences complementary to SEQ ID NOS: 1-14 and 57- 

(d) polynucleotide sequences -whieh-encode an amino acid sequence as set 
... L forth in SEQ ID NOS: 15-28, and 61-64; and . ... : _ : ^ , 

(e) fragments of (a), (b), (c) or (d) that are at least 15 consecutive bases in 
• - . 4 length and that will selectively-hybridizett) DNA which encodes a 

polypeptide of SEQ ID NOS: 15-28, and,61-64. ' ' 

2.. . A vector comprising a polynucleotide of claim 1 . 

• ; 3 : . A host cell containing the- vector of claim 2: ■ - 

" ,'- 4. ' The method of claim 3, wherein-the host cell is a eukaryotic cell. 

' 5. The method of claim 3, wherein the host cell is a prokaryotic.cell. 

' .6. A method for producing a polypeptide comprising: . 

(a) culturing the host cells of claim 3; •. • v 
" ■ ; (b) expressing from the host cell of claim3 a polypeptide, encoded by said 
■ * , * polynucleotide; and - • .,, . 1 

(c) isolating the polypeptide: . . -" ' V : ' : ' 
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7. "An enzyme selected from the group consisting of: ; ■ , 

(a) an enzyme comprising an amino acid sequence set forth in SEQ ID NOS: 
.15-28 or 61-64; and 

(b) * an enzyme which comprises at least 30 consecutive amino acid residue as 

an enzyme of (a). 

8. An enzyme of which at least a portion is coded for by a polynucleotide of 
claim 1 ; and which is selected from the group consisting of: 

(a) an enzyme comprising an amino acid sequence which is at least 70% 
identical to an amino acid sequence selected from the group of amino 
acid sequences set forth in SEQ ID NOS: 15-28 or 61-64; and 
* (b) an enzyme which comprises at least 30 amino acid residues to the. 
enzyme of (a). 

9. * A method for generating glucose from soluble cell oligosaccharides comprising 

contacting a sample containing oligosaccharides with an effective amount of an 
enyzme selected from the group consisting of an enzyme having the amino acid u - 
sequence set forth in SEQ ID NOS: 15-28, 61-63 and t4 such that glucose is 
produced. 

10. The method- of cliam 9, wherein the sample is selected from the group consisting 
~- of dairy products,- fruit juices, detergents, textiles, guar gum, animal feed,.plant 

biomass and waste products. 

11. The method of claim 9, wherein the oligosaccharide is selected from the group 
consisting of maltose, cellobiose, lactose, sucrose, raffmose, stachyose, 
^verbascose, cellulose, starch, amylose, glycogen, disacharrides, polysacharrides 
and pullulan. 
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MUTL CLYCOSIDASE - 2 *G ^ 

COMPLETE GENE SEQUENCE - 9/95 
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. Clu AiVArg.^r Val Uu Zl.Z ^' 

' 1575 . 1584 1593 1602 

CTO. AAA A^TOC.tOO AAC AGC ^A ACC TOG.CAG ^'-CCA CAG «c*38 TCA'CCT^ 
V.1 Lys A.„ Tr P Trp Asn S.r Gly Thr Trp ^ ~; £ ~; I. 

• '■ Figure ilO( Continued) 
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a. «. «, ol „ •„ 01y ^ ~ - ~ -;- ~ - --- --- _ 

=c_= ^£ « «■£ „ ^™ A(a gia ™ ^ ^ ^ 

- «y s .„ « p T „ „. clu v ," ~; » » ~ ~ --- ._. ... 

1737 1746 17« 

~ ~- » « « ■« « « « CCA ^ M _™ 

«. «. « ». u. «. s: £ s: ;i: - - 

1791 1800 Ifloo 

««. « « «»»« 

Lys Gly Arg Leu Arg Pro Tvr Al* v*i r T 

w «tto Tyr Ala Val Leu Asn Pro Gly Trp Val Lys lie Gly 

1845 1S54 iofi-1 

" ?f ™ ^ "= «= "<= <=» «T «C ^'K ^ ^ 

~ - *~ « ; - «• - = n: s £ »; 

1959 1S0S 1317 1Q 2 C 

«„ ryr ^ u . w ^ ~ »; - ~- ~ -.. ... ... 

1353 1962 iQ7i 

«. ^ ,i. u. Cly M vsl sl ; - ~- ~~~ - --- --- 

2007 2 °16 2025 ' 

CW TAT J AGA ACA GGA GGT ATG^TGA 3. 
Phe lie Asp ash V.l Ar g Leu ^ £ ^ ~ ~ ~ "~ — 



Figure lkl (Continued) 
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mi1 lo . P-aanaooidaoo (63QB1) 
" •~ 5 S S 3*'^~ ~ ™ *» TCA. « » c « w 0 » 

,„ * • — ~- ~ 

.J. ==c .2 -..r3V« J? « c «c ^ 

:~ c " lvi *» **• ->• £ * ~ £ -5 5 = 

«. ai ly . L „ ~ ~ ~~ - ~ ~ --- 

«C COC «e A*C ^ TA C C „ Tf? ^ ^ «C ™ „ ^ £ 

-v n e ,, ta ^ clu uu ph . 0 -- ~- ~- --- 

err « a t « ra „ ^ ra c „ ™ ak ^ £ _„ cc . ™ 

01y ^ fa r - " ; - £ - £ s: ^ s . . 

279 288 2S7 

S ^ Trr *f? T. S *"= « « « £ ra ,« £. ra . mA ^ 

- ss£ wis sj. : - 



„. ^ s „ ™ ~ ~~ ~. ~ 

M ? ** - *™ « •s.f.s s£ s ~ - ~ 

~ ?!? !!? ?!? - ™ « « ™ «o ewiS ■« TO £ „ c „ c «< 
v " *~ L " ***'..'"■ - 

495 , S04 513 - ,^ - - - • 

™~5!«««««»«C«C S ATC ==0 S ■« TCC SS 

». v., «. L . u ^ - = ^ ... ... ... 

, Figure 120-, • 
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la JJ-nannooidaoo (S30B1) (continuod) 

549 558 567 576 585 504 

ACG ACA CTT CTT GAG TTT CCC AAG TAT CCT GCT TAC ATC GCC CAT CCG CTC GCA 

Arg Thr Val Val Glu Phe Ala Lys - Tyr* Alt Ala Tyr lie All His Ala Leu Cly 

603 612 621 630 639 64S 

GAC CTC GIG GAC ACA TCC AGC ACC TTC AAC GAA CCT ATG GTA GTT GTG GAG CK 

Asp l*u Val Asp Thr Trp Ser Thr Phe Asn Glu Pro Met vll Val Val Glu 

657 66fi «75 684 693 702 

™ ™ !f! T C ^ ^ 000 GTC ATG ^ C C CC GAG GCC 

Cly Tyr Lou Ala Pro Tyr Ser Gly Phe Pro Pro Gly Val Met xln Pro Glu Ala 

711 720 729 738 747 75fi 

CCG AAG CTC GCG ATC CTC AAC ATG ATA AAC CCC CAC GCC TTC GCA TAT AAG ATG 

Ala Lys Leu Ala lie Leu Asn Met Ilo Asn Ala His Ala Leu Ala Tyr Lys Met 

765 774 . 783 732 801 g< 0 

ATA AAG AGG TTC GAC ACC AAG AAG GCC GAT GAG CAT AGC AAG TCC CCT GCG GAC 

lie Lya Arg Pha As? Thr Lys Lys Ala Asp Glu Asp Ser Lys Ser Pro All 'fill 

819 82 8 837 846 855 864 

GTT GGC ATA ATT TAC AAC AAC ATC GGT GTT. GCC TAC CCT AAA GAC CCT AAC GAT 

Val Gly lie lie Tyr Asn Asn lie Cly Val Ala Tyr Pro Lys Asp Pro As^ k& 

873 882 891 900 909 918 

CCC AAG GAC GTT AAA GCA GCC GAA AAC GAC AAC TAC TTC CAC AGC GGA CTG TTC 

Pro Lys Asp Val Lys Ala Ala Glu Asn Asp Asn Tyr Phe His Ser Gly Leu Phe 

527 936 945 " ' 954 963 972 

TTT GAT GCC ATC CAC AAG GGT AAG CTC AAC AT?A GAG TTC GAC GGC GAA AAC TTT 

Phe Asp Ala lie His Lys Gly Lys Leu Asn lie Glu Phe Asp Gly Glu ten Phe 

... .981 990 - 999 - 1008 - 101T ~ ~ 1025 ' 

GTA AAA GTT AGA CAC CTA AAA GGC AAT GAC TGG ATA GGC CTC AAC TAC TAC ACC 

Val Lys Val Arg His Leu Lys Gly Asn Asp Trp Ilo Gly Leu Asn Tyr Tyr Thr 

1035 1044 1053 1062 1071 . 1080 

CGC GAG GTT CTT AGA TAT TCG GAG CCC AAO TTC CCA ACT ATA CCC CTC ATA TCC 

Arg Glu Val Val Arg Tyr Ser Glu Pro Lys Phe Pro Ser lie Pro Leu lie Ser J 
Figure 12b( Continued ) 
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^ **>ZX la P-nannooidaoo ,„i„, (continued, - ^ 
" l "89 1098 Hoi- , , , r 

■ ■ ~ ~ S!S5 j* « «*« ^ws. 

! 1143 1152 ' " lifii '• 

1157 , 1206 ; , His • i V*} A ' 

- *^s., np 1;u » u , ^ - - 

' iillfhsiss! w * K ™ to ~£ *~ cxc'SI 

■ 1305 ' 1314 1323 * 

••3 £ »~ £ £ «• * — ~1S = M Wig 

. ^„ v ^ ~ ~. ~ - ~ £ ; 

1359 . ■ . ■ 1368 1377 * J ■] 

^ ^ ^^ ^ exu^^ ^ ^. - ~£ . 

1413 '■' 1422 " ■ * r 143V v "' 'V' ; * 

^ ly , vu ^ Uu n . s „. - --- ^ ... ... 

■ E S ™ S « & « * ^ 

»« n. t* «, I1 ,;.v. 1 a Wc ;;.~; --- ^ . 

..,'1521 - ■ 1530. . 1539 "■• . , . 

GAG TTC CTG AAG'GGT GAG GAG AAA TCA 3 • ' ' ' • 

Glu Phe Leu Lys Gly Glu Glu Lyo ' ' ' ' . : • . • -„ . " ' : 



Figure 12C( Continued)' 
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~ ~ 5 £ b « ~ ■*» - '« «»~ £ 
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■» «. S« „ E „ A „ - ra Are - „ « £ w « 

U. U. =1, *. „ ^ v „ „. - ~ - - ~ --- ... ... 

« « CAA CAA „ C „ AAA TAC GGC GAT GTT TTG CTG GAA ATT TO3 aqa CAC 

u. ^ M . P „ a, p ^ 0ly - o;; ^ ~ - - --- 

495 504 ' . 5 n \^ 

^ ^ ™ ™ - - TAC CCC GAA AAT CTb TTU TTT GAA ATC „c HI 

"* "* L " - ^ w ^ - - - - - s w 

Figure ,13£U 
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0C1/SV BndoglBcoaaoo (330P1) (contiauodl 
549 SS8 567 576 585 

!" -f! !*! ^ ^ ™ **C. GtX.-CTT TAT CCA AAA £i 

Glu Pro Ala Gin Asn Leu Thr Ala.Glu Ly. Trp Ai^Ala Ze"u T^r Pro. Lys vll 

603 .• 6i2- .' r .621, , 630 • fi34 : . • : 

.f!- ™ !!! ^ ^.'^ CCA ACC.CGG ATT GTC ATT ATC GAT GCT CCA 

Leu,Lys Val lie Arg Glu S« A.n-:p„ Thr. Arg lie vll ill iH Zl Ala Pro 

657 V ■ ' 6 « - . 675- . 684 693 7n , 

™ !^ ™ ! W ^ AG * AGT 5" ™ GTC **C GAC AAA C« .' 

■1 Ain ; Trp Aia .Iiis .Tyr Ser Ala' Val.Arg .Ser -L-u-Ly.^u"-^! Asn As"p illZ'g 

■ ' 711 . . 720 729. , . 738. ' ' 747 75fi 

ATC ATT GTT TCC :TTC CAT TAC " TAC GAA, CCT TTC AAA TTC ACA CAT CAC CCT GCC 

lie Ile Val Ser Phe His Tyr Tyr Glu Pro Ph. Ly. Phe Thr all oil Gly HI ' 

755 ' 774 783 .. 792 801 ' aiQ 

GAA TOG GTT" AAT-CCC ATC CCA CCT GTT AGO GTT AAG TGG AAT..GOC GAG GAA TCG. 

Glu. Trp Val Asa Pro II. Pro Pro; Val Arg Val Lys Trp Asn',Gly Glu Glu Tr?" ' 

' ' 813 .828 *37. 846 3 55 fiS i 

GAA ATT AAC CAA ATC AGA AGT CAT TTC AAA TAC GTG AGT GAC TGG-GCA AAG CAA 

Glu He Asa . Gin II. Arg Ser His Phe Lys Tyr Val Ser Asp' HI ^ Gin 

. ' « 7 3 882 891 900 909 ' 918 

AAT AAC GTA CCA ATC TTT CTT GGT GAA TTC GGT GCT TAT TCA AAA GCA GAC ATC 

- Asn Asn Val Pro lie Phe Uu Gly Glu Phe Gly Ala Tyr Ser Lys Ala Asp Hee 

937 ; 936 ' • 9«S v : 954 9 63 S72 

GAC TCA AGO GTT AAG. TGG ACC GAA AGT. GTG AGA AAA ATG GCG GAA GAA TTT GGA 

Asp Ser Arg Val Lys Trp Thy, Glu Sex ; Val Arg Lys Me c All Glu Glu Phe ciy . 

981 : 999 : 999 i008 1017 1026 

TTT. TCA TAC GCG TAT TGG GAA TTT TGT GCA GGA TTT GGC ATA TAC GAT AGA TGG , 

Phe Ser Tyr . Ala .Tyr Trp Glu Phe Cys Ala Gly Phe Gly. II. Tyr, A^p Irg Trp 

1035 -. 1044 ■■ 1053 1062 ' : i 0 71 1080 

TCr CAA AAC TGG ATC GAA CCA TTG CCA ACA GCT GTG GTT GGC ACA GGC AAA GAG 

Ser Gin Asn Trp He Glu .Pro Leu Ala thr Ala Val Val Gly Thr Gly Lys Glu 
TAA 3 • 



Figure 13b (Continued 
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' ^ ^ S! - «.«S-C - TGO CAG GCA AAA 

M 6C A SP Thr Ly , ^ Gly ne ne . - ~ ----- 

C*^ ^ GAG ATA AAA GAC GGA AAG GCT GAA GTG £ 

, -p v.,^ .-pm,, a . Ciu ;;; c ^ - - 

S^ ^^- f ~ - TTC TAC GAA AAA CCA GAC ACA W C CC £ 
^ " 8 - V - £ S £ - ~ ^ 

171 ISO ' IP, 

«* -».- = « »- ; «• £ & a = £ 



225 234 243 n„ 

~: ~ ^ ~ ~ « ^ « « « « JS ^ JS „ « ™ 

~ £ Thr T 1 " l " «« - ^ ; " £ = s z; z ~ 

£ - sat ccc £ «„, £« TO>w i» 

». »» s„ WB v., b. Lx ; z: z; ;l: ^ m - - 



~ 2 *» S « "» « « g 

**' va - *" :1 " v " " u s " «? s « =3 £ 

387 396 -'■ , n 

~ ^ - ™ ™ "-"^--taa^^s « 

y.. «„. u. «. x.. «. „ ^ z: « £ - ~ ~ : 

..-441-,- 450- . 459 — - — ~ - - -t~ — r -.--.: 

CTC GAC CAC TAC. TAT TAC GAT GGA GAG GTC GGA J£ GTA TAT £ CCA GAG I£ 

LeuAapAspTyrTyr ^ A - ci * «- »; ^ 

-- G ^ ^ !!= «3 S GTA AAG ^CTT.CTC TTC 

Thr . Ile Ph« Axg Vftl Trp Ser Pro Val clr'T" ""~ " 

• • Val Ser Lys ^HP v ai L ys Val Leu L«u Phe 

Figure 14^-' 
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Tho^otogo Witinc, *xllula*aoo ( ff0 P3, (continued 



549 



AAA JW .uWS'Aj CCc"c CAG" GTT CTG AAC ATG GAA TAC AAG ^ 



603 



612 



A 3 „ Gly VaX.^Glu Aaa Vai Val ^ S^g ^.jii; £ £ ~ ^ 

657 * 666 675' c*a c " ' 

2 ^ ^ « ™«C ACA ACA ACC'GTC CAT S TAT TCG..IaA 

Tyr Gin Leu Glu Asn Tyr. 



, 711 720 
CCG CTT TAC GCA AAC AAC 

Ala Val Tyr Ala Asn Asn 



■Oly Lys ' lie Arg Thr Thr Val Asp ,Ko : i£ ' Ser- iys': 

729 738 747 \ 75S 

CAA GAG AGC CCC GTT GTC. AAT CTT GCC AGO ACA AAC 

Gin* Glu'Ser Ala Val Vol Asn Leu Ala Arg Thr Asn 
765 *' 774 , 783 79 V ' ort , 

' V ™ ^ S ^ ^ GGA.CCG AAA ATC GAA GCA J£ GAA GAC IcS 

' Pro Glu Gly Trp Glu,A» Ks 9 Z' g Gl'y III j£ .uj olu" sly T^ gIu ^ Ma. . 
819 ' 828'. 837 84S " - ee ' -.. 

' -I- *!f 2 ™ ^ « C ^AC ATC ACA GCA CTC GAA AAC- TCC GGG Ut ' 

. ii. tvx oiu n-.Hio.-ii; aI; s^j-e waw.'^'ii';:: • 

873 ; " 882 891 900 * . 

: ~ £ ™ 2* « J*** GAA GAA AAC' ACQ Z GCA CCG SI 

- ;-Lys Asn ty 8 Gly I^Tyr Urn G^y W T^ ~' ^ ^^£>^. • 

927 '936 ' ' 

: ;' -I !!! !^ ^.£'2?!2 ^ «M <** ^.<^ «T. jS CAC GTT. CAT ' 
Gly Val Thr Thr. Gly Uu S*l Hia ul V.I Gliuu cly Val T^.' Hi! vli- ^ , 
981 ; " 990 999 1 nno 

: 5.™ - "c^crc-^ 

"•' 118 ^ pro Pha ciy as^gIu 

1035 1044 - 1053 ' m-», 

. AAG TAC ™. AAC g jAC^T 9CT WS^ £ .io^.: , 

. Ly, Tyr Tyr>„ Trp Gly Tyr A^p, Pro Tyr Uu ^ M«' vll' P« gIu ITy Arg; ^ 

Figure 14b( Continued) 
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Thornotoga naritina 

£ z~ = ~~ - - ~ ~ *» J- ^ 

- ~ „ « ,„ Ul = - ~ - ...... 



1143 U52 

« « « ^ « «. « „' £ COT ««S „ OAC^ATc' otg TTC^CCT 

1137 1206 t51 . 

His T J £ ;z ::- ~ - ~ ?*! 2? J? r° CCG tac 

JV* aiy m 01y 0iu tou s „ Aia ^ w ^ v - ; ... ___ 

~ *« -C 1 ^ ACA GCT^ TAT ^ GAA AC3C ^GGA TGT 

p. _ , ■ — AQ C GGA TGT G3T AAC 

Ph« Tyr Ar S II. Asp Ly 3 Thr Gly Ala Tvr , 7 " — — 

nr Gly Al. Tyr L* u Asa 01u s „ My ^ 6 ^ n 

1305 1314 n,-, 

™ ™ « « « « ccc „ «J » lTJ ^ ^ ^nso 

- n.^. s „ a . ^ ,„ - - --. ~ --- --- _ _ _. 

« ^ c^ « „ - ^ ra - ^ ^ w ^ 

•» « «. *. M , m - - -■- - ~- --- _ ... _: 

1413 1422 lAii 

-3! ~ « « » ■« ^ ~ « « ^ „ „«K - 

"* A "° lys - -* - - - - - - £ s; z 

1467 i 47fi 

«T AT, ATT CTC TAC «C GAA ccc'S GGT ^ w ^ ATC A*/ 5 " 

------ «w wua CCG ATC AGG TTT 

mn. ^ ^ Ky S1 „ Pro Trp oly ^ ^ £ ~ ~" = - = 

«» ^M. ^ V.. ^ aly ^ ~ ~ - - = - _ ... ... 

Asp Ala Ha Arg Gly Ser Val pv,* . ^ - --- 

V.X Phe A.„ Pro Ser Vai Lys Gly v ^ 

Figure 14C(Continued) • 
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Thorootoga naritlna *,il«?a.«, 0 . ( So „, {coatlauod 
- 16 - 2 - 9 .. 1638 "47 1656 I6<5 



OCA TAC C=A M ? CAA ACC AAG A TC AM aGgWS G „ cca"^ ATA ^ 

•/ Gly cly Lys z s« n;;;,; £ v 
• ;- p 01y Lys Leu rle ^' s ~ *-^ _^ou Asp Pro ^ ^ • 

v cca aaiZ cac^aac t^S ccc ccc™ 

1791 . 1800 1809 ' iflio 

~ !!! AAA AAC^C CAC AAA^CTG 

Ala Asp Lys Lys.Lys, Glu .Trp ^clu £ tilu Leu Lys' Asn aI. L£ Leu ' 
1845 1854 , 1BS3 ^73 too. 



Gly Gly Gin 



■f ' Mar Gly : Xaa Ile Lau ser cm ci y va ^' Phe ^ >L~s 

■ 189 . 9 1908 ' ' i9 17 - 152fi \ ' e 

- .Sf ™ 3 ^ ™ ™ ^ ™= ™ £ TAC AAC^C CCT ATC^TCG 

' Asp Phoney. Ar S Thr Thr A.» Asp A^ III £ A «. m 

1553 ' 1962 1971 • " , oan ' 

... 'tl^ T-S^: *±~w*\c*&*» a*™ ™ aa T "ac 

', 2007 ' 2015 . . 2025 ■ ^cna - - 

^ -A^cl" GCT TTC AGG CTG AAA^AAC 

1 His Lys Gly Leu lie Lys Leu' A^s Lys CI u & 'Z Zl Phe ^ ^ l^.a^ 
2061 • 2070 2079 2088 ' ' ,„ e , 

.!!! ^ 2?.-*!!:^ ^ ^ S GAA- TTT CTC CCC GCC GCG AGA AGA ATA'^GTT 
Al. Glu Glu lie; Lys Lys His Le« clu Z :Ua >^ Cly cly ^ A^ vli 
2115 2124 2133 - 

; :!!! ^ !!^^ >? 3 ^ccr ^ - ^ ^ ATC GTC 2 ^ 

Ai- Ph. Met Leu Ly, Asp HI. Ala Gly bly Asp ^ L y 0 As'p ni' ^ vll 

' " • Figure 14d( Continued) 
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2169 2178 21 »? 

« « «, « „c „ ^ CTO ^ „ 
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Figure. 15. THemoto?* mar iti,a MSB8 (Clone > 6GP2) Glycosida^e 

1 • • "... ■ ' v . ' , ' ' ' ' 

CTTvTTA ^ ATC GTT GAG CTC TCT TTC GTT C7C TTT GCA ACT GAC GAG TTC ' > 
^Leu Leu.IXe va! Glu Leu Ser ,h. Val Leu Phe Xia.Ser .Asp «» ^ > 

,dTO .AAA GTG GAA AAC GGA AAA TTC GCT CTG AAC GGA AAA GAA TTC AGA TTC ' 

■ Val Lys V,l clu. Asn Gly,Ly,,h, Ala Leu Asn Gly Lys cl u Ph , ' Arg ^ ■ : ; 

T,2TT C MC ATG CAC TAC AAG" AGC AAC GGA ATG~ ATA GAC ■ 

. Gly. Ser Asn Asn Tyr Tyr -Met Kis-Tyr Lys SeV'Asn' Gly .Met' He Asp * 

AGT GTT CTG GAG AGT GCC AGA GAC ATC GGT ATA AAG GTC CTC AGA ATC TGG ■ 
Se. Val ..Leu Glu Ser ..Ala Arg Asp Met Gly lie Lys Val L =u ^ Ue Trp ' 

GGT TTC.CTC GAC GGG GAG AGT TAC TGC AGA GAC AAG AAC ACC TAC ATG CA- ' 
Gly Phe Leu Asp, Gly Glu Ser Tyr Cys Arg Asp Lys. Asp. Thr Tyr Met His 

OCT GAG CCC GOT' GTT TTC GGG GTG CCA GAA GGA ATA TCG AAC GCC CAG AGC. ' ' 



Pro Glu; Pro Gly Val ,, e Gly Val Pro Glu Gly lie Ser -Asn; Ala Gin 



Ser 



• .«! TTC GAA AGA CTC GAC TAC ACA GTT GCG AAA GCG AAA GAA CTC GGT ATA 
, Gly Phe Glu Arg Leu Asp Tyr Thr Val Ala Lys Ala Lys Glu Leu ; Gly lie 

, AAA CTT GTC ATT GTT CTT GTG AAC AAC TGG GAC GAC TTC GGT GGA ATG AAC 
Lys Leu val He ;Val Leu Val- Ash Asn Trp Asp. Asp Phe Gly Gly Met Asn 

CAG TAC GTG AGG TGG TTT GGA GGA ACC CAT CAC GAC GAT TTC TAC AGA GAT ' 
Gin Tyr Val.. Arg Trp Phe Gly Gly Thr His His, Asp Asp.'phe Tyr Arg ; Asp 

GAG AAG ATC AAA GAA GAG TAC AAA AAG TAC GTC TCC TTT CTC GTA AAC CAT 
Glu Lys He Lys Glu. Glu Tyr .Lys Lys Tyr Val Ser Phe Leu Val Asn' His. • 

GTCAATACCTACACG GGA GTT, CCT TACAGG GAA GAG CCC .ACC ATC ATG GCC 
val Asn. Thr Tyr Thr Gly Val Pro Tyr Arg Glu Glu' Pro Thr lie Met Ala 

TGG GAG CTT' GCA AAC GAA CCG CGC TGT GAG ACC GAC AAA TCG GGG AAC ACG 
Trp Glu Leu Ala Asn Glu Pro Arg Cys Glu .Thr Asp Lys Ser Gly Asn Thr 
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CTC GTT GAG -TOO GTG AAG GAG ATG AGC TCC t&o *™ 

— - « ^ „. * » « - » - ™ « « 
' " r s zz z z z z - - — r 

±r rne pne Ser Asn Tyr Glu Gly • 

TTC AAA CCT TAC GGT GGA GAA GCC GAG TGG rr- ™„ ' 

. „ ^ <jAl ' TGG GC>- TAC AAC GGC TGG TCP rrr 

Phe Lys Pro Tyr Gly Gly Glu A1 , TGG TCC GGT 

' Wa G1U Tr ? AIa Ty^r Asn Gly Trp Ser Gly . 

^^zzzzzzzzzzzz 

-^-zzzzzzzzzzzz ■ 
^^zzzzzzzzzzzz' 

-ZZZZZZZZZZZZZZZZ 

Z Z Z Z Z Z Z Z r W ^ TAC GAT m GGT GGA GAT . 

Asi; Asp Leu Val Tyr Asp Lett Gly Gly Asp 

" z z z r r cic aco cm mc 000 ° w «** *» « « -c 

"* "r"' "I" «y .« Asp 

" e ~ ~ ~ » r z oac ° gt ™ sm » « « - 

9 Gly lyr Tyr Pro .Asp Tyr „„ My phe ^ ^ ^ ^ . , . 

^zzzzzzzzzzzz— ' 

y v^-lu iy r Aia Lys Leu Phe Asn Thr Gly 

GAA GAC ATA AGA GAA GAC ACC TGC TCT TTC ATC CTT CCA m „ 
Glu Asp H e r ,„ h ' . < CC ^ AAA GAC GGC ATG 

P A., G.u „p Thr Cy , s „ „„. „, ^ ^ ^ ^ ^ 

'GAG ATC AAA AAG ACC GTG GAA GTG AGG-GCT GGT „ TTC GAC TAC AGC AAC 
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. , Glu He Lys Lys Thr. Val Glu Val Arg Ala Gly Val; Phe Asp Tyr W Asn 

ACG. TTT GAA AAG TTG TCT GTC AAA GTC GAA'GAT CTG GTt' -TTT GAA 'aAT GAG 

■ Thr Phe Glu Lys Xeu .Ser Val ;Lys Val Glu Asp Leu Val. Phe Glu Asn' Glu V 

ATA GAG CAT CTC GGA TAC GGA ATT TAC GGC TTT GAT CTC GAC ACA ACC CGG 
. Ile G - lu - His Leu Gly Ty'r Gly lie Tyr Gly Phe Asp". Leu.. Asp Thr Thr Arg ' 

: • ATC CC G : GAT- GGA GAA CAT GAA' ATG TTC CTT GAA GGC CAC TTT. CAg VgA AAA 
■ ' He Pro Asp Gly Glu . His Glu, Met Phe .Leu Glu. Gly His Phe Gin Gly Lys 

_ ACG .GTG AAA- GAC TCT ATC AAA GCG AAA GTG G7G AAC GAA' GCA CGG. TAC GTG 
• Thr Val Lys. Asp Ser lie Lys Ala . Lys Val Val -Asn Glu Ala Arg Tyr Val ". 

■ CTC GCA GAG GAA GTT GAT TTT TCC TCT. CCA GAA GAG GTG , AAA AAC 7G3 TGG 
Leu Ala .Glu Glu Val Asp- Phe Ser Ser Pro Glu, Glu Val Lys Asn Trp Trp 

", J AAC AGC GGA .A'CC.. TGG.' CAG GCA GAG TTC GGG TCA CCT ' GAC. ATT GAA- 7Gg''aAC ' ' 
. Asn Ser- Gly Thr Trp Gin' Ala Glu Phe Sly Ser Pro A* ? Ml e , Glu Trp' Asn 

-, . GGT GAG GTG GGA AAT' GGA GCA, CTG CAGjCTG AAC GTG AAA CTG. CCC GGA AAG. 
Gly Glu Val' Gly Asn Gly Ala; Leu" Gin ' Le.u Asri" ;/al- Lys Leu Pro Gly Lys ' ' 



, AGC GAC TGG. GAA. GAA GTG AGA GTA GCA AG3 AAG ' TTC GAA, AG A' CTC 
■ Ser Asp Trp Glu -Glu Val 'Arg Val Ala Arg Lys Phe- Glu. Arg Leu 



TCA GAA 
Ser Glu 



>■;■ TGT GAG ;; ATC/ CTC . GAG TAC >GAc" ATC' TAC ' ATT CCA AAC GTC GAG GGA CTC AAG ' 
.■ Cys, Glu He Leu Glu Tyr Asp'lle^Tyr- lie Pro Asn Val Glu Gly Leu Lys 

, GGA AGG TTG AGG CCG TAC GCG GTT. CTG AAC CCC GGC TGG GTG AAG ATA GGC 
. . Gly Arg, Leu Arg Pro Tyr Ala Val Leu Asn Pro Gly Trp Val. Lys lie Gly . 

■ CTC GAC ATG AAC AAC GCG AAC GTG. GAA AGT GCG GAG ATC ATC ACT TTC GGC ' 
f.'-'H U - A SP M et Asn. Asn : Ala Asn, Val .Glu Ser-. Ala Glu lie lie Thr Phe Gly 

/;. ' GGA AAA GAG TAC AGA AGA TTC CAT GTA AGA ATT GAG TTC GAC AGA ACA GCG 
Gly Lys, Glu Tyr Arg Arg' Phe His Val. Arg lie Glu. Phe- Asp- Arg Thr Ala 



Figure 15C(continued) 
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Z ^ T T ^ ^ m ^ — ™ ™ GAT 

c-iy vu Ly3 Giu Leu Hia Ile . Gly val vai Qiy ^ ^ ^ a « 

c s ne c r r gtg - aga ctt tat aga - - - - 

y I.. Phe lie Asp Asn- Val Arg Leu Tyr Lys' Arg Thr Gly «y Met 



TGA 

END . 



1991 



Figure 15a(continued) 
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6( 
' 2C 



40 



■ ' . Figure No. l^Thermocoga maritima MSB8(6gb4) 

1 ATG AAA- AGA ATC GAC CTG AAT GGT TTC TOG AGC GTT AGG GAT MC CM GGG AGA TTT TCG 
, l L y? Arg He Asp Leu A3 „ Gl y Ph, Trp.se, Val Ar 3 Asp a,„ Glu Gly £ Z'ur- 

?1 TTT GAA GGG ACT GTG CCA GGG GTT GTC CA3 GCA GAT CTG GTC AGA AAA GGT CI- CTT CCA 120 
« ;Phe Glu Ci y Thr Valero Gl y v a lv a l Gin Ala As P Leu Val Arg Ly s G l y £ ^ £ "° 

131 ,CAC-CCG TAC GTT GGG ATG AAC GAA GAT CTC TTC AAG GAA ATA GAA GAC AGA GAG TGG ATC 
•«J His Pro Tyr Val Gly Mec Asn 0 lu Asp Leu Phe L yS Glu n B Glu Asp Arg Giu Trp *™ 

181- . rAC.GAG- AGG GAG TTC GAG TTC AAA GAA* GAT GT3 AAA GAC}' GGG GAa' CGT GTC GAT CTC GT~ 
TyS ° 1U>r9 . G1U ' Phe G1U - We ^ =l-a'Asp Vai Lys clu oiy Glu- Arg V a l.» sp Leu Val " . 

Z J° ^ ACG CTG TC = "AT .GTT TAT CT, AAC GGT GTT TAC CTTGCA AGC ACC 

8, .Phe Glu- Gly Val Asp Thr Leu Ser. Asp Val 'Tyr "Leu Ash. Gl y Val Tyr Leu Gly Ser Thr 



421 GTC CTC GCC GGT CCT GAA GAT CCC ATC AGA GGA. TAC ATA AGA AAA GCC CAG TAT TCG TAC 
141 Val Leu Gly Giy Pr0 Glu Asp Prp . Iie krg Qly ^ ^ Lys , . ^ ^ ^ ^ ^ 



180 

' BO 



240 

80 



300 
ICO 



301 GAA GAC ATG TTC ATC GAG TAT CCC TTC -GAT GTC ACG AAC GTG TTG AAA GAA , AAG AAT CAC „, 
" G1U MECPhe 116 Glu ^ .*"*.«:■ Asp Val Thr a,.-, v.1 Leu ^Glu , ys , AS n His l 20 - 



■361 CT3 AAG GTG TAC A~A^ AAA TCT CCC ATC AGA GTT CCG AAA ACT CTC GAG CAG AAC TAC GGG . 420 
121 Leu Ly. v.1. Tyr;:i e Ly. Ser Pro' rie Arg v al Pro Lys T,r Leu Glu Gin Asn.Tyr Gl y ' 140 



480. 
ISO 



481 GGA TGG GAC .TGG 1 GGT GCC AGA ATC GTT ACAf AGC GGT ATT TGG AAA CCC GTC TAC CTC GAG '. 540 
Gly Trp Asp Trp Gly Ala Arc. lie Val Thr Ser oiy. lie Trp Ly. Pro Val Tyr^eu Glu 180 



600 
200 



S4! .GTG TAC AGG GCA CGT CTT CAG GAT TCA ACG GCT TAT -CTG TTG GAA CTT GAG GGG AAA GAT 

181 Val Tyr Arg Ala, Arg. Leu Gin Asp Ser Thr Ala' Tyr Leu Leu Glu Leu Glu Gly Lys' Asp " 

601 GCC CTT GTG AGG GTG AAC GGT TTC GTA CAC GGG GAA GGA AAT CTC ATT GTG 'GAA GTT TAT 660 

2P1 Ala Leu Val Arg .Val Asn oiy Phe Val His Gly Glu GlyAsn Leu ile y a i Glu'Val Tyr '220 

HI A ^ t "' Gt ^ GAA AAG. -ATA GGG GAG TTT CCT GTT CTT GAA AAG AAC GGA GAA AAG CTC 

' ^ Gly G1U LyS " e ^ Slu^e Pro Val Leu Glu Ly, Asn Gly Glu Lys Leu Phe 240 

241 Z r " C " C *** *** ^ TGG " AT CC ° TGG ™ GGG AAA CCG 780 

"1. Asp Gly val P he His Leu Lys Asp Val Lys Leu Trp Tyr Pro Trp Asn' Val Gly Lys Pro 26 0 
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114 0 

380 



■ --zzzzzzzzzzzzzzzzzz 

SOI TTC ATA TTC GAA ATC AAC GGT GAG AAA GT^ TTC rr™ 

- - - «. ar 01u - - - £ - - - - - , «. 
= --ZZZZZZZ-ZZZZZZZZ r 

"81 TAC AGA CTC TGT GAT GAA CTC GGT ATC A~G G'G T" . 
3- Tyr Arg Leu cys A3p ^ ; ^ - ^ ^ -C ATG TAC GCG TGT CT 

y i-e Met Val .r? G.n Asp Phe Met Tyr Ala Cys Is- 

1141 GAA TAT CCG GAT CAT CTT C~G ~GG T-" ' 
331 Glu Tyr Pro Aso His ^ p_ 0 * p ,"" ~ = *** 3AG AGA AAG ATT J20 0 

. 6 ^ ^ L6U Ai * Asn Giu «« Aia Ars Lys He . 40S . 

»01 GTG AGA AAA CTC AGA TAC CAT ■ CCC TCC ATT GTT C - ' 

401 vai : *■ * - ^ - - se, „. v J L c ;: £ ^ - r " c . 1260 

^ ^-ys ^ay Asn Asr. Glu Asn Asn 420 
1261 TGG GGA TTC GAT GAA TGG GGA AAT AT rr~ , ' -' : 

m - «* - - - *. «, - * ~ r £ s r r ac ~~ - 

* ^3 V*. ASp „ly T le ^ ^ 

"21 • AGG CTC TAC CTC TTC OA** T**- rr-r n*n _ 

- u. ^ « £ £ ~ ; - ~ - - ~ ~ « « « « „, 1 

uy S Ala W 1 U GiU Asp Pro Sfir p _ o ^ 

1381 TGG CCA TCC AGT CCA TAC GGC GGT GAA AAA -er »»r ~ 

115 - s - - - - — - £2 ~ s z is - r 

/ Asn Tyr Glu Lys Asp Thr Gly Arg 500 

-zzzzzzzzzzzzzzzzzz™ - ' 

olu Tnr I1 e Giu Phe Phe Ser 520 
"SI AAA CCC GAG GAA AGA GAG ATA TTC CAT CCC GTC a TP ' ' 

■» . 'v. » «. «. Ar9 ». n . Phe « « - » « « «, m „ l6!0 

iys his Asn Lys Gin Val Glu 540 
Figure 16b(continued) 
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. , . Arg Arg Phe Phe Ala G!u Val Leu Pro val ,eu L ys ,ys Arg Agp Asn . Lys Ile Glu W 

z zzz oT r r tct;gacj > sac am aga ;„ ^ TGaAGC rtA . .■ 1983 . 

, «-« vaa 01y Glu M9 Ser;Glu Gly Asp Lyg ^ Ser . Leu ^ ^ ^ ^ . ^ 

1981. CGA GAA OAA GOG AGA AAA GOT ATT CGA AAA GAC rAar '^r W «"1 ' 

«* — - - «: z z z %z>:% - 

2041 TGT GAG TTT GGT TGA ' 2055 ■ ' . ■ ' ' ; . ,~ »/. 

681. eye Glu Phe Gly End 685"'. ■ , ^' ' • ; * \ ' ^ *■ ;*./; 



Figure 166(continued) 



/ 
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Figure No. l*,Bankia gquldi (37gp 4 ) 
• 1 ATG AAA AAA AAT -CTA CTA ATG TTT AAA AGO CTT ACG TAT CTA CCT TTG ttt „ ■ ' 

uys His Gly Arg Leu Gin Val Asp 

- = - ~ - - - = = ^ = = r - - - - - - z - 

301 GGA AAT GGC TAT r — - 

36! GCA GCT ATT GCT AAC GGC ATA TAT GTA ATA ATA GAC TGCCAC ACT ~AC GAA n ' 

121 Ala Ala lie Ala A 9 n Gly n e Ty- Val lie il. » T ^ 

. ^ ^' Val Ua Ile As P Tr ? His Thr His Glu Al. Glu Leu 

m'£ : £ " z ™ r r ~ m °" ™ ~ - - - _ L 

A.„ lyr oi„ 01 „ M ^ w „. s „ ^ ^ ^ ^ ^ ^ 

» " « s z z ™ r ' r T " =ca m - c - -* - - *> •» 

«. Tyr ,„ r ,« u Kl , „,, „, ^ h ._ ^ ^ ^ ^ 

'« OTA OCA- CAC » C » oca' TO „, „, AAT GTT GCT TTC TTT G-r V. 

"1 v»l Al. oin Thr Al. l.„ .... , °" ™ 505 aCA »« 

Thr Ala i«„ a,„ 4 ,„ v „ „, 



60 

20 

120 
40 

180 
60 

; -240 
, 80 

300 

10.0 

350 
120 

420 
140 

480 
160 

'540 
180 



600 

Val 200 



660 
220 

720 
240 

780 
260 
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280 

'900 
300 



■lil ' L T r T T ^ ^ ^ ^ *** AGC ACT *** ^ TG<3 ATG GCC TTT TTG " .„ 

■■' ■ W C cT r T - ^ ™™ T " W ** GAC AAA GCT TTT CCT GAA ACA 

■ 901 GGG TCT .GTA GTT CAA GCA GGA CAA GGT GTA TCT GGT TTA ATT AGC AAT AAA CTT ACA GCC ' „„ 

1081 GGA GAT GAA ATT ATA ATT GCC CCT GGA AAC TAG AAT "»i .«■■« • .. 

Ml. Gly Asp-Glu I,, n. T1 ,v * C ATACA * GOT GCC 1140 

- . ? 16 Ali Pr °' AS " ^ **» .^e Oln Asp Ly, n, oin piy Ala . 3 ,o' 

11.1 TTT AAC CCT ACT CTT TAG CTT. TAT GGT AGT OCT AAC GGA AAC ACT ' ACA AAC -CC" A~~ A~A ' i 2 ~ 5 

"CI' TTA AGA GGC GAA AGC GCT ACA AAC CCT CCT GTT.TTC T~CA 

. U« Ar S Gly G,u Se, Ala ,,r Asn ? r= Pr0 Vai P he V oly Leu'As ? Tyr ^ Asn £ "0 



GGA TTA GAT TAT AAC AAT GGC 12€0' 
Gly Leu 'Asp 

- , , -- ™ OAT ATA GAG 

,421 .Tyr Le , Leu ser ne ^ Asp Ty? ?rp ^ ne ne ^ ^ . ^ 



^1 :Z "r*«y«^«T- W.«f 7 GG AAT ATT AAA GAT ATA GAG TTT AAA ACT 



GGG' " 13 2 0 



1321 TCT AAA 'GGT ATT -GTT CTT GAC AAT TCT AAT GGT AGT AAA TTA AAA AAC CTT GTT GTT OA" ^ 
«« S-r Lys Gly tl. vai ,Uu Asp Asn .,ar Aan Oly Sar Ly. ,eu ^ Asn t ;au V.l Z W ' . " 0 

1331. GAT ATT. GGA GAA GAA GCT ATT CAC TTG CGT GAT GGA TCT AGC AAT AAT AGT A^A GAT GGT ■ 1«0 
■ -1 Asp 'He Gly. Glu Glu Ala Xle. „ is Leu A , 9 Asp G!y Ser>er , sn A? „ S J ^ £ £ ; ^ 

-1 n.^ZTT^ A ^ AAA CCT GGT TTT GGT GAA GGT TTA TAT GTA GGC ' 1S00 . 

V3 lie Tyr Asn Thr Gly Arg Thr Lys Pro Gly Ph e Gly Glu. Gly Leu-Tyr Vai . Gly 



500 



; s ^ tit r gac Act ^ gaa ^ - - ^ - ^ ^ aac « 60 

Asp Lys Gly Gin His Asp Thr Tyr Glu Ar 9 Ala C ys Asn Asn Asn , Thr ,1. Glu Asn S20 

»! Z S S ot So a" T GCA " GAA — - - - ACA ATG AAC 1„0 

Oly P ro Asn v.l Thr Ala Glu Gly Vai Asp Vai Lys Glu Gly Thr Met Asn S<0 

Figure 17b(continued) 
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1621 ACT ATT ATA AGA AAT TGC GTG TTT TCT era m * 

zzzzzz zzzzzz:z 

! 166! OCT TTT ATT GAT TTA AAA GGA GCCTAT GOT TT- GTA ta,- » ' 

~ - - " - - - z ~ « ~ ~ z z z z z z f 

»' ;~ ~r z z z z zz zz z r r ~ » - - - » •••• 

31 Asp Phe Leu Asp Arg Gly Thr Gly Phe Asn Thr 600 
1801 GGT TTT AGA AAT 'OCA ATA TT n. .„ 

1851 TCA ACT GCT CGT AAA AAA CAA GGT TCT TT GAA rAA 

- —^——-zzzzzzzzzz l l" 

1321 AAC CCT AA T '"T C"T r>a^ ~r. 

"i ^ p ro Asn . s : r v ;, ° A ; ? :: !* a ; a agt:3a7 ^ «* *« ^ ^ ^ TTC lfl0 

-P »- P.o U. aer Asp Oly T,r Glu Asn- Leu,V Sl . AS n Ly, p a . S60 

•1981 TGC CCA GAT TGG AAT ATA GAA CCA TGT AAT — -TA r 

681 Cys Pro Asp Trp Asn - .. „.„ - "~ ACC ^ CAA =CA CCT *CA 2040 

^ irp.Asn ..e ^. ?ro Cy3 Asn pro . 

■ Asn dr. Ala Pro Thr 680 

2041 ATA AGC TTC CTA TT i-t r-™* „,.„ "■■•'- 

«i n, ser Ph "* ; r T : v 6 ; AC w att act ™ »* «* ™ ^ 2100 

V " ASn AS " Ile ?h -' * u Vai «« "V Tyr Asn Leu Gin val 70 o . 

2101 GAA GTT AAT GCT ACT OAT" CCA GAT GGA ACT ATT GAT AA- -a 

Glu Val Asn Ala Thr Asp v B _ „ *" A " ^ **" ° *** CTT ~^ ATA GAT AAC, 2160 
- A S? A-a Asp Gly Thr lie Asp Asn Val Lys Leu Tyr He Asp Asn 720 

21S1 AAT TTA GTT AGG CAA ATA AAT TCT ACT ~CA -AT AAA t~ „ 
»l As, Leu Val ^ Gin ue Asn Ser t" £ £ £ " " « ^ »" 

iyr Lys Trp „i y H is Ser Asp Ser Pro Asn 740 

--zzzzzzzzzzzzzzBzzzz 

z z z z z zz z z Z 2 z s z z zz z z z ' 

---zzzzzzzzzzzzzzzzzz z 

"01 AAG TTT TCT AAC GTT TTT GAG TTA GGA TCT GGC GGA CCA TCT TTA ACT AAT TA '' 

^ .v-i TTA AGT AAT TTA AAA ACA 2460 

Figure 17<t t (continued) 
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eo; ; . 1 ,> h ..s. C A.nv.x Ph9 = lu Giy Ser cly 01y Pro Ser , eu Ser Asn Leu . Lys Thr ^ 

AAC GGT GTA CCT GAT TAT TAT ATA AAT TTA AAA CCA AAA ATT AC" TTT Ctr rm 

. 841 Asn Gly Val Pro Asp Tyr Tvr II. » M , ' , ■ CAG TTT AAA AAT 2S80 

_ A-p Tyr. T yr He Asn Le «, Lys Pro Lys IleThr phe Gin Phe Lys Asn. e S0 

2561 GCA AAT CCA GAA ATA TCT ATT AGC AAT AGC' TTA ATT' CCT »ir t-r-r 

861 Ala Asn Pro Glu Ue Ser n, Ser Asn Zl 1 T ' * ^ ™ C ' TGG '*«*'» 

He Ser Asn Ser Leu He Pro Ash Phe Asp Gly Asp Tyr Trp 880 



2701 TTT AGT AAT GAC GCT ACT OCT CCT ATT TGT -AA* G~T AC" rr~r »~ ' ' 

• 'SOI P he .Ser.A 3 n Asp Ala Thr Ala: Pro Ile £ £ £ £ P " ^ ^ ^ ^ - 27S °" 

° .-e =ys Asn y ai Pro Ser Asn Gin lie Ser -Lys 920 

.."SI ATT ' ACT GAT GAT TCT AGT ATT AAT ..TTT AAC CTT TAC CCT ;v, ^ _ — » ^ '' - ' " ' - 

9 21 Ile'T h , Aap , As? , er ;, Ser ., ne . Asn T ,;_ - * " ^ ACT , 2820 

r i-cu i.,. ^ 0 Asn. Pro Aid Leu Asp- Glu Thr 940 - 

S p7 H ^ 6CT ^ ™ «r WA .' CCA GtSa7 , : • 

^ , Ile Xla « U *-T ^ l-eu. Ala te , Val Leu Val Pro * ss - • 



Figure- 17d(conciaued) 
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Figure No. Pyroccccua furiosus VC1(7EG1) 



leader sequence: amino acids 1-24 



9 18 27 36 4 « 

5 » ATG AGC AAa Ana sur? 34 



ATG A6C AAG AAA AAG TTC GTC ATC GTA *CT ATC TTA »r> », 

« - *. «. *. ». „ a . w; s : - ™ ^ «. » » - 



Ala lie Tvr Ph. v., „ _ _ _ ~ "»» TtA ACT TCA AAT 



OCA ATA TAT TTT GTA CAA AAG TAT CAT ACC TCT GAG GAC AAG TCA ACT TCA AAT 
Ma a. xyr Pne Va! oiu ,ys ryr His Thr Ser Glu Asp Lyg * £ £ £ 



"7 126 



135 144 



' "° 185 »• 207 

r z iz r r ogt m ™ cci « - « - « « «. « 

A-? Ty. Pro Asp As? G,v Glu Trp Pro Glv Ala ?~ Q -, e a ^ 

~ J *- e *-ys Asp Gly Asp 



225 234 243 ? = 

252 261 



. COG AAC CCA GAA TTC TAC ATT GAA ATA AAC CTA TGG AAC ATT CTT AAT G=~ ACT 
«y Asn Pro Glu Phe r/r ne Giu ne ^ ne ^ - ACT 

279 ' 28 3 * 297 305 ^ 

CGA TTT GCT GAG ATG ACG TAC AAT TTA ACC AGC GGC GTC CTT ^C TAC GTC ^ 

Gly phe ^ Glu Met * - - - - 01y val Leu Z Z Zl 2 

CAA CTT GAC AAC ATT « TTG AGG « AGA AGT 2 TGG GTG CAT GGA TAC CCC 
" U n = U« **** ■« As, Trp Val His Gly £ Pro 



387 39S 414 423 



GAA ATA TTC TAT GGA AAC AAG CCA TGG AAT GCA AAC TAC GCA ACT GAT GGC CCA 
■1« He Phe Tyr Cl y Asn ,ys Pr0 T,p Asn Ala Asn ^ Ala T hr a" £ Pro 

ATA CCA TTA CCC AGT aH CTT TCA 2 CTA ACA GAC TTC TAT CTA ACA ATC TCC 
XL L eu P„ Ser Ly8 val Ser Asn L eu T hr Asp Phe T. r Leu £ £ £ 
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TAT. AAA CTT G AG CCC AAG AAC G GC ^ CCA ATT MC TTC CCA ATA GAA TCC-TGG 
Tyr Ly-uu Glu.Pro L ys Asn Gly Leu Pro . Ile Asn Phe A la Ile Glu ser Trp 



545 



558 



S57 



TTA ACG AGA GAA GCT TGG AGA ACA ACA GGA ATT MC AGO GAT gIg CAA GAA GTA 
Leu Thr Ar g .Glu Ala Trp Ar g Thr Thr Gly „. ^ ^ ^ ^ ^ » ™ 



603 



612 



621 



, ATG ATA TGG ATT TAC TAT GAC GGA TTA CAA CCG GCT GGC TCC Z G ~ ^ ^ 

x; e T^ue.^ ^ A8P oly ieuGln . pro Ala . Giy se , h . av Z™,Z 

. ATT GTA Ore CCA ATA ATA GTT AAC GGA ACA CCA' GTA AAT GCT ACA- TTT GAA GTA 
XI, Val -Val p ro He He Val Asn Gly Thr Pro Val Asn Ala Thr Phe Glu Val 

711 720 729 MB' • 747 7S6 

TGG AAG GCA AAC ATT GGT TGG GAG TAT GTT GCA TTT AGA ATA AAG AC" CCA A-C 
Trp Lys Ala Asn lie Gly Trp Glu Tyr Val Ala Phe, Arg a.* Lys Thr Pro n * e . 

• 7S 5 774 783 792 8(T 1 

AAA GAG GGA ACA GTG ACA ATT CCA TAC GGA GCA TTT ATA AGT GTT GCA GCC lie ' 
■ uy. Glu Gly Thr Val Thr 'ile Pro Tyr Gly Ala Phe .He Ser Val Ala Ala Asn . 

' 819 , , 828 ■ 837 . •« ,/ 8SS • 8S 4 

ATT TCA AGC TTA CCA- AAT TAC ACA GAA CTT TAC TTA GAG GAC GTG GAG ATT GGA 
.'He ser Ser Leu Pro Asn Tyr Thr Glu' Leu Tyr Leu Glu Asp Val Glu Ile Gly 

873 . 882 891 900 ■' , 909 ' - 918 

ACT GAG TTT GGA ACG CCA AGC ACT ACC TCC GCC CAC CTA' GAG TGG TGG ATC ACA ' 
Thr Glu Phe Gly Thr Pro' Ser Thr Thr Ser' Ala His Leu Glu Trp Trp i> Tnr 



927 



936 



■ 945. , 954. 

AAC ATA ,ACA CTA ACT CCT CTA GAT AGA CCT CTT ATT . TCC TAA - 
Asrr Ile Thr Leu Thr Pro Leu Asp Arg : Pro Leu lie-. Ser * 



Figure lBfat continued) 



INTERNATIONAL SEARCH REPORT 


International application No. 
PCT/US97/22623 


A. CLASSIFICATION OF SUBJECT MATTER 
IPC(6) :C07H 21/04; C12N 1/20, 1/14, 5/00, 9/38, 9/42; C08B 30/04 
US CL :435/207, 209, 252.3, 254.11, 274, 275, 320.1, 325; 536/23.2 

Arrv>rHincr tn Int^mjttinnfll Pflt<*nt PlnwificAtinn (\PC\ or to both national classification and IPC 


R FIELDS SEARCHED 


Minimum documentation searched (classification system followed by classification symbols) 
U.S. : 435/207, 209, 252.3, 254.11, 274, 275, 320.1, 325; 536/23.2 


■ rtruii intent af ls\r* oj*o rv^h^A r\l K ^r- fkan minimum rlnpil m/»nf a fr inn tf> th^ f* ¥ I n f that Cllfh HoPltmAntQ fl fTG included IFl the fields SCAflchcd 

L/ocumcnLation se&rcnco outer man mininium oocuiijciiuiiiun vu uic cawhu uwi autu uu^uuiwiiLa ait uitiuuvu ut uit nuus avoiviibu 


Electronic data baae consulted during the international search (name of data base and, where practicable, search terms used) 
Please Sec Extra Sheet. 


C. DOCUMENTS CONSIDERED TO BE RELEVANT 


Category* 


Citation. of document, with indication, where appropriate, of the relevant passages 


Relevant to claim No. 


X 
A 


GRABNITZ et al. Structure of the p-Glucosidase Gene bglA of 
Clostridium thermocellum: Sequence Analysis Reveals a Superfamily 
of Cellulases and P-Glycosidases Including Human Lactase/Phlorizin 
Hydrolase. Eur. J. Biochem. September 1991, Vol. 200, Mb. 2, 
pages 301-309, see entire document. 


1-3, 5 
species EL 

* i'i , 

4, 6-11 ■ * 


X 
A 


VOORHORST et al. Characterization of the celB Gene Coding for 
p-Glucosidase from the Hyperthermophilic Archaeon Pyrococcus 
furiosus and Its Expression and Site-Directed Mutation in Escherichia 
coli. J. Bacteriol. December 1995, Vol. 177, No. 24, page;; 7105- 
7111, see entire document. 


1-3,5 

species I and HI 
4, 6-11 


|~| Further documents arc listed in the continuation of Box C. Sec patent family annex. 


* Special categories of cited document*: 

•A' document do fining the general staLe of the art which U not considered 
to be of particular relevance 

"B* earlier document published on or after the intarnationil filing date 

*L* document which may throw doubts on priority olatm(i) or which is 
ottad to establish the publication date of another citation or other 
tpeoial reason (as specified) 

"O* document referring to an oral disclosure, use, exhibition or other 
moons 

"P" document published prior to the international filing date but later than 
the priority date claimed 


•T" tater document published after the international filing date or priority 
data and not in conflict with the application but cited to understand 
the principle or theory underlying .the invention 

"X" document of particular relevance; the claimed invention cannot be 
considered novel or cannot be considered to involve an inventive step 
when the document is taken alone 

"Y* document of particular relevance: the claimed inventioo cannot be 
considered to involve an inventive step when the document is 
combined with one or more other such documents, such combination 
' being obvious to a parson skilled in the art 

'A* document member of the same patent family 


Date of the actual completion of the international search 
26 MARCH 1998 


Date of mailing of the international search report 

&1 APR 1998 


Name and mailing address of the ISA/US 
Commissioner of Paicnls and Trademarks 
Box PCT 

Washington, D.C. 20231 
Facsimile No. (703) 305-3230 


Authorized officer y — - 

LISA J. HOBBS, PH.D. . 

Telephone No. (703) 308-0196 ^ 





Form PCT/ISA/210 (second sheet)(July 1992)* 



INTERNATIONAL SEARCH REPORT 



International application No. 
. PCT/US97/22623 



Box [ Observations where certain claims were found unsearchable (Continuation of item I of first sheet) 



This international report has not been established in respect of certain claims . under Article 17(2) (a) for the following reasons; 

Claims Nos.: . . • 

because they relate to subject matter not required to be searched by this Authority, namely: 



| | Claims Nos.: 



2. 



Q ■ bc^^hcrrilfiic to paru of the international appUcaUon that do not comply with the prescribed requirements to such 
an extent that no meaningful international. search can be carried out, specifically: 



3. F| Claims Nos.: - > ■ ■ . . ' , A/ . 

M because they are dependent claims and are not drafted in accordance with the second and third sentences of Rule 6.4(a). 



Boa II Oboervadono where unity of Invention is lacking (Continuation of item 2. of first sheet) 



Thb International Searching Authority found multiple inventions in this international -application, as follows: 
Please See Extra Sheet. > V ' : 



1 . fl; : AsaU required additional search fees were timely paid by the applicant, this international search report covers all searchable 
claims. 



2. Q As all searchable claims could be searched without effort justifying an additional, fee, this Authority did not invite payment 
of any additional fee: 

3. fxj Ai only some of the required additional search fees were timely paid by the applicant, this international search report covers 
— only those claims for which fees were paid, specifically claims Nos.: \. . . 

1-11, species Mil . . 



4. f~l No required additional search feca were timely paid by the applicant. Consequently, this international search report is 
1 — 1 restricted to the invention first mentioned in the claims; it is covered by claims Nos. : 



Remark on Protest Q The additional search fees were accompanied by the applicant's protest. 

|~| No protest accompanied the payment of additional search fees. 
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INTERNATIONAL SEARCH REPORT 



International application No. 
PCT/US97/22623 



B. FIELDS SEARCHED . 
Electronic data bases consulted (Name of data base and where practicable terms used): 

APS and STN (Bioscience and Patent Indexes): Des ulfurococcW, Staphyloderma, Thermatoga, galactosidase^, 
glucosidase#, beta galactosidase^, beta glucosidase#. Genbank, EMBL, ESTsl-4, STS, N-Geneseq: Seq. iD Nos.: 1-3 
and A-Gencaeq, PIR, Swissprot: Seq ID Nos.: 15-17, 

BOX II. OBSERVATIONS WHERE UNITY OF INVENTION WAS LACKING ( 
This ISA found multiple inventions as follows: 

This application contains claims directed to more than one species of the generic invention. These species are deemed 
to lack Unity of Invention because they are not so linked as to form a single inventive concept under PCT Rule 13.1. 
The species are as follows: there are 18 distinct enzymes disclosed in the description, as enumerated in Figs. 1-18 and 
Table 1. , 

The claims arc deemed to correspond to the species listed above in the following manner: while all the claims form one 
Group for examination, each of the claims is generic to the 18 enzyme species disclosed. 



The speciea listed above do not relate to a single inventive concept under PCT Rule 13.1 because, under PCT Rule 
13.2, the species lack the same or corresponding special technical features for the following reasons: each enzyme is a 
different product, thus has the special technical feature of the recited enzyme, which the other species lack. 
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